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ABSTRACT 

Thi«^ -port summarizes a series of e^ght research 
studies related tc the use of computer-assisted instruction (CAI) 
with mildly handicapped students at the junior high or high school 
level. Through videodisc cmd CAI the studies isolated the effects of 
the following design variables: (1) review cycles; (2) size of 
teaching sets; (3) explicit strategies; and (4) correction 
procedures. Studies involved three different kinds of CAI — drill and 
practice, tutorials, and simulations* Results indicated that properly 
designed CAI can be effective as an instructional medium if attention 
is paid to the academic task, the stage of instruction, and the role 
of the teacher. Deted.led reports are presented in the form of 
preprinti> or reprints of journal articles with the following titles: 
••Applying Instructional Design Principles to CAI for Mildly 
Handicapped Students: Four Recently Co .ducted Studies^^ (John Woodward 
et al.); ••Effects of Instructional Design Variables on Vocabulary 
Acquisition of LD Students: A Study of Computer-Assisted Instruction** 
(Gary Johnson et al.); ••Elaborated Corrective Feedback and the 
Acqu-^sition of Reasoning Skills: A Study of Computer-Assisted 
Instruction** (Maria Collins et al.); ••Teaching Problem Solving 
through Computer Simulations*^ (John Woodward et al.); ••The 
Effectiveness of Videodisc Instruction in Teaching Fractions to 
Learning-Disabled and Remedial High School Student s^^ (Bernadette 
Kelly et al.); and ••Closing the Performance Gap in Secondary 
Education •• (John Woodward et al.). References accompany each paper. 
(DB) 
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SlJr«ARY fF FINAL REPCOT 

This project included a series of ei^t researdi studies related to the use of 
conputer assisted instruction (CAI) with mildly handicapped students, either 
at the junior high or higfi sdxx>l level. Tliese studies were designed to examine 
a range of irstructional design principles that have been previously denpnstrated 
to be effective techniques in non-conputer studies and the appliciability of these 
for jjtproving the effectiveness of different types of CAI programs and the 
videodisc. Through videodisc and corputer assisted instruction, project staff 
were able to isolate the effects of the following instructional design variables: 
(1) review qycles, (2) size of teaching sets^ (3) explicit strategies, and 
(4) correction procedures. 

Several of the studies in the project involved three different kinds of 
CAI — drill and practice, tutorials, and simulations. Thus, a secondary 
interest of these studi*»o was to examine the iirpact of different design variables 
across a range of CAI program types. The results of these studies indicate 
that properly designed CAI can be effective as an instructional medium. 
For exanple, the Vocabulary Instruction study demonstrated that a skill requiring 
considerable practice can be adequately taught on a corputer , and the Reasoning 
Skills program was successful at teaching a more corplicated academic 
task — logical, inferences. The Math Word Problems study, however, appeared to 
indicate that the best way to teach skills in this area may be tnrou^ teacher 
directed instruction firsts with the corputer used for guided practice. 
The specific outcomes of the studies in this project lead to a better under- 
standing of the ^pJication of technology in special education as it is linked 
to better instructional design principles. Another implicit outcome of this 
project is the need for careful consideration of the academic task, the stage 
of instruction, and the role of the teadier in order to make optimal use of 
computv^r assisted instruction. 
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Applications of computer technology in special education have all too often 
come about because of enthusiasm over the hardware and selected software 
programs. While most advocates in the field are adept at detailing technical 
capabilities of this medium, little has been done in the way of systematic empirical 
research into the use of computer assisted instruction (CAI) for the mildly 
handicapped. This report summarizes eight studies conducted in this area over the 
last three years. 

Each study is grounded in the instructional design principles articulated by the 
senior project director on this grant (Engelmann & Gamine, 1982) and others in the 
field. These principles have been empirically demonstrated as effective techniques in 
non-computer studies and in fifteen years of research on effective curricuium design 
variables as part of Project Follow Through. The studies described in this involved 
the three different kinds of CAI (i.e., drill and practice, tutorials, and simulations) as 
well as videodisc instruction. Three of the eight studies are refinements of the CAI 
studies conducted during the first year of this grant. In each case, we attempted to 
build upon the earlier study, either by investigating more specific hypotheses or by 
testing improved versions of the software program. 

As a program of research, these studies support our initial hyposthesis, namely 
that empirically-based instructional design principles are applicable for Improving the 
effectivness different types of CAI programs and the videodisc media. Furthermcs, 
our research enabled us to begin to make some recommendations about teacher 
practices in relation to tne use of educational technology. 

Research Problem 

The instructional design piinciples investigated under this grant have been 
articulated by Engelmann and Gamine's (1982) Thenrv nf instruction. These 
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principles have been empiricaiiy demonstrated as effective techniques in non- 
computer studies (e.g., Gamine, 1980; Gamine, Kameenui, & Woolfson, 1982; Darch, 
Gamine, & Gersten, 1984} and in our fifteen years of experience with Project Follow 
Through (cf. Stebbins et al.. 1977). 

Through videodisc and computer assisted instruction, we have been able to 
isolate the effects of the following instructional design variables: a) review cycles, b) 
size of teaching sets, c) explicit strategies, and d) correction procedures. We did this 
in a variety of ways. Two of the studies compared software that we developed to 
popular commercial GAI programs. In two other studies, we compared variants the 
same software program, either by using different approaches or by using different 
versions of the same program. Two additional studies compared the use of 
technology (i-e., GAI and videodisc) to traditional teaching practices. Rgure 1 shows 
the relationship of the different instructional design variables to the eight studies. 

Rgure 1 
Instructional Design Variables 
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Research Methods 

All of the studies described below were conducted with mildly handicapped 
secondary students, either at the junior high or secondary level. All were selected 
from special education resource programs and screened for skill deficits in the 
targeted academic area For example, all students in the math word problems study 
were competent In basic arithmetic operations (through division) and knew how to 
solve addition and subtraction word problems. Thus, It was appropriate to teach 
these studetns word problems involving division and multiplication on the fucus of the 
study. Students whose skills were above or below this level were not used In the 
study; those who remained were randomly assigned to treatment conditions. Each 
study involved a group design, with random assignment of subjects to conditions. 
Finally, in order to precisely measure academic development, tests were created for 
the particular academic skills taught in each study. When appropriate, measures of 
knowledge transfer were also included. 

Findings 

The research findings have been or will soon be published in a variety of 
special education and technology journals. As each article fully documents the 
procedures used as well as the specific findings, we have appended a copy of each to 
this report. Except for the last two studies, which were completed in June of 1987 and 
thus remain unpublished at the time of this report, all six studies are described in the 
following six articles. We have also included abstracts of those two unpublished 
studies. For ease of reference, the table below indicates which studies are associated 
with the following articles. 
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Woodward, J., Camine, D., Gersten, R., Gleason, M., 
Johnson, G.. & Collir^s, M. (1986). Instructional 
design principles for CAl: A summary of four 
studies, ilniim?' ?^pft"a' gdueation Teohnoloav. 
J:{1), 107-118. 

Johnson, G., Gersten, R., & Camine, D. (1987). 
Effects of instructional design variables op 
vocabulary acquisition of LD students: A study 
of computer assisted instruction. 
J ournal of tea ming Disabilities. 20(4). 206-213. 

Collins, M., Camine, D. W., & Gersten, R. (in press). 
Elaborated connected feedback and the acquisition 
of reasoning skills. A study of computer-assisted 
instnjction. Excentional Children. 

Woodward, J., Carnine, D.,t& Gersten, R., (in press). 
Teaching problem-solving through computer 
simulations. American Educational Research Journal . 

Kelly, B., Camine, D. W., Gersten, R., & Grossen, B., 
(1986). The effectiveness of videodisc instruction 
in teaching fractions to learning handicapped & 
remedial high school students. Journal of Special 
Education TQchnoloov. 8(2). 5-17. 

Woodward, J., Carnine, 0., & Collins, M. Closing 
the perfonnance gap in secondary education. Article 
submitted to the Journal of Computer Research in 

The effect of an explicit strategy on student 
comprehension of problem solving skills 
taught in a computer health simulation. 
University of Oregon: abstract 

[Abstract indicated here war not included 
in document received by ERlJ.l 
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Health Problem Solving 



Vocabulary Instruction 



Reasoning Skills 



Health Problem Solving I 
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Vocabulary Instmction 
Reasoning Skills II 
Health Problem Solving I 
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Abstract 
Health Problem Solving II 

An experimental study was conducted to determine the effectiveness of 
teaching an explicit strategy to a group of special education students for solving a 
series of health problems using computer simulations. Students were matched 
according to reading ability and randomly assigned to either an experimental or 
control group. They were given a pretest to determine their knowledge of basic health 
concepts. 

During an intervention period of fourteen days, students in both groups studied 
health using a traditional form of health curricula and also a compute health 
simulation. The only difference between groups was the experimental students were 
taught a specific strategy to apply to the computer simulation. The control group 
received no such strategy. 

Detailed study was completed on student performance across six test 
simulations. The defined objective on the simulation was to make decisions that 
would help surpass the profile's given life expectancy. 

After the treatment period, four separate measures were given. The first test 
measured the students' knowledge of basic concepts taught. The other three 
measures examined students' ability to generalize knowledge gained in the 
simulations to new contexts. These included a series of videotaped health profiles 
and written profiles. 

Both groups scored significantly higher on the health concepts posttest than on 
the pretest. While playing the test simulations, the treatment group scored significantly 
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higher than the control in achieving the defined objealve. This difference appeared to 
be due to the strategy that the experimental students were taught. 

No significant differences were found on the three generalizations measures, 
with one exception. A significant difference was found on one of the videotape 
measures, indicating experimental S's were more likely to correctly identify health 
changes required. 

On the written generalization measure, students were expected to identify and 
prioritize health changes. Although able to identify the required and corresponding 
behavioral changes, they lacked skills needed to prioritize these changes. 

The results of an attitudlnal survey indicate students generally felt they had 
developed a strategy. They also strongly indicated they enjoyed studying health using 
the computer and the simulation; few wanted to continue studying health using 
traditional texts. 

Researchers: Hollingsworth, M., Woodward, J., & Gersten, R. 



Applying Instructional Design' Principles to CAI for Mildly Handicapped 
Students: Four Recently Conducted Studies 



John Woodward 
Doug Carnint 
Russell Qersten 
Mary Gleason 
Gary Johnson 
Maria Collins 



Journal of S ppHal g^iieatlon Tftchnologv. 7M^. 1986 



Research for these studies was sponsored under the Department of 
Education grant number G008400600. We would also like to thanlt 
IBM for loaning us the computer hardware that enabled us to conduct 
some of these studies. 
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special education has passed through a phase where computers have 
been vcidely embraced and uncritically adopted. The enthusiasm over 
computers and their potential impact on special education can be documented 
with little difficulty (e.g.. Budoff. Thormann, & Gras, 1984; Blaschke, 1985). While 
most advocates are adept at detailing the technical capabilities of this medium 
(e.g., immediate feedback, automatic scoring, Individualized instruction), little . 
has been done in the way of systematic research Into the use of computers • in 
particular, computer assisted instruction (CAi) • for the mildly handicapped. This 
report summarizes four studies recently conducted in this area. They are the 
beginnings of what we consider to be systematic research into CAI for the mildly 
handicapped. 

The limited research on the instructional effectiveness of CAI for 
handicapped and non-handicapped populations is complicated and often 
contradiutory. After a comprehensive search of the literature, Forman (1982) 
conclucied that achievement was rarely enhanced by CAI, even though students 
exhibited positive attitudes toward such instruction. When studies do show 
effects on learning (Edwards, Morth, Taylor, Weis, & Dusseklorp. 1975; Bums & 
Boseman, 1981 ), they are modest and isolated, far from the more generalized 
impact on thinking skills that educators and enthusiasts have long claimed would 
result from CAI (Bangert-Drowns, Kulik, & Kulik, 1985). We are not surprised by 
thess mixed findings, as little of the available software used In special education 
settings makes use of even the most rudinientary prindples of sound 
instructional design and effective teactiing (cf. Stevens & Rosenshine, 1981 ; 
Engelmann & Camine, 1982; Brophy & Good, 1984). 

In 1984, we began a sieries of CAI studies that examined different 
instructional design principles that have been articulated by Engelmann and 
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Gamine (1982) and others. These principles have been empirically 
demonstrated as effective techniques in non-computer studies (e.g.. Gamine. 
1980; Gamine Kameenui. & Woolfson. 1982; Darch. Carnine, & Gersten, 1984) 
and in our fifteen years of experience with Project Follow Through (cf. Stebbins 
et al.. 1 977). The studies described below involved the th;ae different k'nds of 
traditional GAI: drill and practice, tutorials, and simulations. Thus, a secondary 
interest of this research was to examine the impact of different instructional 
design variables across a range of GAI program types. 

Through computer assisted instruction, we have been able to isolate the 
effects review cycles, size of teaching sets, explicit strategies, and correction 
procedures. We were able to do this in a variety of ways. Two of our studies 
compamd software that we developed to popular commercial programs. In 
another, we examined the effect of one variable (a correctioii procedure) by 
modifying our version of the software. In the last study described in this report, 
we used our software as an adjunct to a written curriculum to teach specific 
problem solving skills. Rgure 1 shows the relationship of the different 
instructional design variables to the four studies. 

[Insert Rgure 1 about here] 

All of the studies desciibed below were conducted whh miklly 
handicapped secondary students. The two tutorial studies involved students 
from different junior high schools in a medium sized district. In each of the two 
remaining studies, the participating students were all from different class periods 
at the same high school. Students were mostly while and from middle and -)wer 
middis c\B9S families. Al! were selected from special education resource 
programs and screened for appropriate skill deficits before it was determined if 
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practice programs for teaching vocabulary to mildly handicapped adolescents 
(Johnson. Gersten, & Camine, 1986). The study examined the effect of size of 
the daily teaching sets and provisions for daily and cumulative review on the 
acquisition and maintenance of word meaning. Two CAI vocabulary programs 
were used to present the same 50 words and definitions. 

Two designs were used in this study: 1) a time to mastery (Will there be a 
significant difference between times required to meet mastery criterion on the 
50 words by students taught with the two different CAI programs?), and 2) fixed 
design, In which all subjects were tested after the seventh session. We also 
looked at differences between protest and posttest scores as well as 
maintenance of effects two weeks after students achieved mastery. 

Method 

Twenty-four mildly handicapped high school students from an Initial pool of 
38 students were matched by scores on a 50 item vocabulary pretest and 
randomly assigned to one of the two CAI programs. The students were all 
referred to the school's special education program for remedial reading or 
language arts instmction and were identified by the resource teacher as 
needing vocabul'ary instruction. Students worked individually on tl • .\jsigned 
program 20 minutes a day for 1 1 days. All of the words, which w^re the same 
for both programs, were considered important by two or more special education 
teachers. A final list composed of 25 verbs and 25 adjectives was -jsed. 

7hft HAI Programs. One program used in the study, the Small Teaching 
Set program, tests students on words and then creates lessons with the words 
they cannot identify (Camine, Rankin, & Granzin, 1984). After testing the 
students on new words, the program provides instmction on a "teaching set" of 
no more than three words which, the student misi ' .1 on the test. Each lesson 
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^so includes a -praclice sef with a maximum of seven words. The student 
™st meet a specilic mastery criterion on each word l»tore it is amoved from 
ttapracticeset The program tests the student on new wonjs and adds words 
tt»studentdoesnotl«»wlothepracticeset. Once the student has mastered 
ten words, the program presents a cumuiative review lesson on those words. 

The other program, the Teaching Set program, teaches words ,n - 
sets of 25 words (Davidson » Eckert. 1983). The student may choose to see the 
wontein anyoffourtypesofformats: (a) a toaching display which shows the 
word, its definition, and one example sentence: (b) a muftiple choice quiz 
format; (c) an exemise in which a definiUon is displayed and the student must 
span the correct missing wo-d to complete a sentence: and (d) an a«ade-type 
game in which the student matches words to their deSnitions. 

IjaasuifiS. A 50 item, multiple choice test was developed for the study (.79 
coetlicion.. alpha). This test was administered to all subjects as a pretest, as a 
cmerion reference test at the end of seven sessions, immediately after mastery 
(or at the end Of the eleventh session), and two weels after mastery. There 
were also two transfer measures. One was a 10 item objectiva test in which 
sttKlents defined words omIV. The other test required students to answer 
comprehension quesBons that require Knowing the meaning of words in several 
short passages. 

Eight of the twelve subjects (67%) in the Urge Teaching Set program and 
ten of the twelve subjects (83%) in the S.t>bI1 Teaching Set pmgram met 
„,astery criterion by the end of 11 sessions. The study was terminated after 
eteven sessions because the experimenter felt that the subjects who were sUll 
stmggnng to reach mastery were no longer benefiting from instmction. The 
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mean number of sessions to mastery (for those who reached mastery) was 7.6 
for those in the Small Teaching Set and 9.1 in the Urge Teaching Set program. 
Table 1 presents the descriptive statistics for both groups. Resutts of a 1 - test 
indicate this difference is significant (p < .05). Hence, subjects in the Small 
Teaching Set program met mastery In significantly less time. 

[Insert Table 1 about here] 
A 2 X 2 analysis of variance (ANOVA) was perfomied on posttest and 
maintenance ter.t scores, indicating no significant main effect for type of 
instruction. Results on the multiple choice test in the fixed timedesign (i.e.. the 
test administered to all students after seven sessions) indicates a slight, but 
nonsignificant difference in means favoring subjects in the Urge Teaching Set 
program. Differences between scores on two transfer measures were also 
statistically nonsignificant. 

The unequivocal finding of the study was that the subjects taught with the 
small Teaching Set program reached mastery criterion on the set of 50 words 
faster than subjects with the Urge Teaching Set program. In addition, more 
students in the Small Teaching Set program reached mastery within eleven 
lessons. Given tha» the groups achieved equivalent levels of perfomiance on 
the multiple^hoice tests, their difference in acquisition rates becomes even 
more meaningful. Subjects taught with the Small Teaching Set program 
required less time to meet mastery criterion on the words, yet their posttest 
performance was equal to that of subjects in the other treatment who took 
longer reaching mastery. In addition, the shorter instructional time which the 
Smaller Teaching Set program subjects required did not negatively affect their 
retention of word meanings. 



Tutorials 

R^flgnninq Skills: Correction Procedures 
Much of the recent literature on improving special education teaching 
practices has stressed the importance of providing academic feedback to 
students when they made errors (Camine, 1980; Riath. Polsgrov- & Semmel, 
1981 ; Stevens & Rosenshine, 1981). Furthermore, a recent meta-analysis of - 
the limited research on corrective feedback by Lysakowski and Walberg (1981) 
suggests that detailed corrective feedback is superior to merely telling students 
whether their answers are right or wrong. Just telling students they are wrong 
(called a "basic correction") does not help them solve the problem correctly. 
Tl,ese authors suggest that studenti^ need to see an overt model of all the steps 
necessary for an appropriate response. By observing a model of ail the steps 
necessary In obtaining a correct response, students receive detailed 
Information on how to solve the problem. This procedural knowledge should be 
of use when they encounter similar types of problems. This type of correction Is 
referred to as an "elaborated correction." 

This was the first of two computer tutorial studies, and the primary Intent 
here was to examine whether remedial and mildly handicapped students who 
receive elaborated correction procedures would perform significantly better 
than students provided with basic corrections (Collins, Carnlne, & Gersten, In 
press). We also examined any differences regarding acquisition time between 
students. Reasoning skills were chosen as a subject because they tend not to 
be routinely taught to special education students (Zetlln & Bilsky, 1980). 
Furthermore, the strategy used to teach these skills (i.e.. basic analysis of three 
statement arguments) was highly rule-governed, thus It Is an appropriate for an 
elaborated correction procedure. 
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Twenty-eight mildly handicapped and remedial junior high school students 
from a pool of 34 students were selected and randomly assigned to the Basic 
Correction v.r Elaborated Conrection group. Participating subjects had nt least a 
fifth grade reading leve! (but also had a reading comprehension deficiency of 
no more than three years) and passed a screening test that measured their . 
understanding of large and small classes. The Elaborated Correction group 
used an unaltered copy of the CAI program used in the study. The Basic 
Correction group used a modified version of the p.ogram.. If a student in this 
group made an error, they were only given the correct answer. This was the 
only difference between the two conditions. In both conditions, students worked 
individually on a micrxomputer. Students worked on their respective version 
of the program until they completed five lessons. 

Tho r.Ai Program. The Reasoning Skills program (Engelmann, Camine, & 
Collins, 1983) was designed to teach students tv^o major objectives: a) to draw 
conclusions from two statements of evidence and b) to determine whether a 
three-statement argument was logical or Illogical. The program taught students 
about overlapping classes and non-overlapping classes. They learned that 
there are three possible key w jrds (some. all. no); the same rule holds for all 
three. It also taught students relevant rules for constructing and analyzing 
arguments. The other major objective of the program was to teach students to 
identify unsound arguments. For logically unsound arguments, students were 
taught to specify one of three reasons why an argument was unsound. 

MpflRures. The Test of Formal Logic (Collins, 1984) was the primary 
dependent measure 'n the study. The purpose of this test was to measure a 
student's ability to construct and analyze syllogistic arguments. Two alternative 
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forms of the test were designed; Form A was used as the pretest and 
maintenance measure (given two weeks after treatment terminated) and Form B 
was used as the the posttest measure (given immediately after the treatment). 
The internal consistency reliability (coefficient alpha) for Fomi A was .90 and .91 
for Form B. Parallel fomi reliability between Forms A and B was .84. 

There was also a 15 item transfer test that evaluated subjects* abilities to 
generalize what they had learned on the computer to similar analytic tasks, but 
in prose paragraph form. The transfer test was devoted to the more difficult 
objective on the program - deciding whether arguments were sound, and. if not 
sound, giving a reason. This test was given to subjects on the day after they 
completed training on the CAI program. 
Results 

A 2 X 3 analysis of variance (ANOVA) with one between subjects factor 
(Type of Correction) and one within subjects factor (Time of Test) was 
performed on the data. This analysis Involved a planned comparison that 
looked at the post and maintenance tests only. Table 2 presents the descriptive 
statistics for the pretest, posttest, and maintenance tests. The ANOVA indicated 
a significant difference favoring the FJaborated Corrections group (c < .001^ 
There was also a significant difference between the two groups on the transfer 
test, again favoring the Elaborated Correction group (b < .05). 

[Insert Table 2 about here] 

Data were collected on the time students took to complete each of the five 
lessons. The purpose of this analysis was to see whether students In the 
Elaborated Corrections group took more time to complete the lessons. A 2 x 5 
analysis of variance (ANOVA) with repeated measures was performed on the 
time-per-lesson data and non-slgnificant difference between groups was found. 



Discussion 

This study was the first to explore experimentally the effectiveness of 
elaborated corrective feedback in teaching a complex cognitive skill to 
handicapped learners. The results indicate this is an effective instructional 
procedure. 

The roughly equivalent time for the two groups to complete the five lessons 
seems anomalous at first. With more text to read in elaborated conrections. that 
treatment would seemingly take longer to complete the lessons. Completion 
t'-ies were not significantly greater for the elaborated corrections group, 
however. The extra time required to read the elaborated corrections may have 
been compensated for by faster acquisition of the material. In both versions of 
the program, the computer would return a student to items that were missed 
earlier in a lesson. If elaborated corrections resulted in fewer mistakes, 
students would spend less time returning to missed items. This interpretation 
suggests that taking more time early in a complex instructional sequence to 
offer elaborated corrections may. in fact, lead to savings in instructional time 
later in a program. 

Both the basic and elaborated correction groups improved their reasoning 
skills as measured by the dependent variable. Tha groups demonstrated a 
mean score of 68 - 70% on the posttest (a dramatic gain from the mean scores 
of 26 to 34 percent on the pretest). The systematic design of instruction - 
particularly through a series of carefully controlled mies - may have contributed 
to this gain. Reasoning skills were acquired without any instmction from the 
teacher. Typically. CAI programs merely provide drill and practice exercises to 
supplement teacher instruction. Here the program was a true tutorial and did ai 
the instructing. 



M pth Wnrri Prnhlems: Pypltrit Strategies 
In our second study of tutorials, we examined the effectiveness of this kind 
of CAI program in teaching math word problems. Unlike the basic analysis of a 
three statement argument, solving word problems is a much more complex skill. 
Students are required to make many discriminations and success depends, to a 
large degree, on linguistic analysis (Jerman & Mlrman, 1974). Although a large 
number of studies have b(3en conducted in math problem solving, few have 
yielded any adequate infomiation for building effective interventions because of 
flaws in research design (Wlpatrick, 1978; Sllbert, Camine, & Stein, 1981), and 
varying definitions of problem solving and the tasks to measure problem-solving 
ability (Silver & Thompson, 1984). Furthermore, the success of future problem 
solving research depends on less on a continued analysis of the learner and 
his or her deficiencies and more on 1) an analysis of the limits of Instmction the 
students are currently receiving and 2) development of strategies that will work 
with low achieving students. 

The specific purpose of the study was to determine whether handicapped 
students could learn to solve multiplication and division math story problems if 
taught a strategy that focused on how to choose the con-ect operation (Gleason. 
1985). It was hypothesized that students who received explicit instruction on 
choosing the operation wouW solve more problems correctly than students who 
received instruction that did not specifically focus on the choice of operation and 
concentrated, instead, on a more general strategy of manipulating units. A 
wider Interest of this study, one directly related to our systematic research 
agenda, was In whether or not a CAI tutorial alone would be effective in 
teaching a complex cognitive skill. 



Mftthod 

A pretest-posttest design with random assignment of subjects to treatment 
groups was used to examine the effectiveness of two procedures for teaching 
mildly handicapped students to solve math word problems. Twenty-six junior 
high school students from two different schools were randomly assigned to 
either a "direct Instruction" math story problems program or a highly regarded, 
commercial program. Each student worked at a computer for 15 to 30 minutes a 
day for 1 1 days. 

Th P HAl Programs. The direct instruction program Analyzing Word 
Problems (Gamine. Hall. & Hall. 1983) Is based on principles of a theory of 
instruction described by Engelmann and Gamine (1982). The approach 
requires direct teaching of a clearly-specified, step-bystep strategy. When 
instructing the students, the teacher models each step in the process, heavily 
prompting the students as they continue to use the process. The prompts are 
systematically faded until students reach independence. When students make 
errors, the teacher again models or provides a prompt based on a previously 
taught rule. This kind of strategy Instruction is Incorporated Into the Analyzing 
Word Problems program. The program teachas students how to solve 
multiplication and division word problems In a step-by-step fashion. When 
students err. they are given a rule-based correction (e.g.. "Does the statement 
contain the word Each or Every? If so. what kind of problem is itr). 

The Semantic Galculator (Sunburst Gommunlcatlons. 1983) was used as 
a contrast to the Dl program. This program is based on the premise that the 
major difficulty In solving word problems comes from inappropriate 
manipulation of units (e.g.. weeks, apples, dollars, etc.). If a studert could be 
taught to extract from the problem the quantities needed to solve the problem 
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and the correct units for the answer, they would bo able to solve the problem. In 
this program, a student is guided through story problems by answering "How 
many?" and "Whatr^ questions about worri problems that are written on 
woricshoets. Next, the student uses the letters A and B to type in the operation 
that should be perfomied on the numbers (i.e.. "A^" o divide and "A x B" to 
multiply) and then predict the units in the answer. The computer shows the - 
student what units were used to express the answer to the problem. If the 
student answer did not match that of the computer's, the student knew that he or 
she should go back and try again. 

Measures. Both the pretest and posttest were 28.item tests comprised of 
11 multiplication. 10 division. 2 addition, and 5 subtraction problems. All items 
were selected from three major arithmetic intermediate level textbooks and from 
the California Achievement Test. Sixty^ight percent of the problems on the test 
were like ones Included in the instmctional lessons; the remaining 32% were 
transfer problems. 

Results indicated no significant differences between performance of the Direct 
instruction group and the Semantic Calculator group on the posttest and in the 
amount of time used to take the posttest Intsrviews with students as they 
performed problems (i.e.. choosing the correct operation and telling a reason for 
the choice) did yield a statistically significant difference between the groups 
favoring the Direct Instmction group, but the mean perfomiance levels for both 
were not considered educationally significant. Table 3 shows the descriptive 
statistics for the two groups on the pre- and posttests. 

(Insert Table 3 about here] 
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Disftiission 

There are many possible reasons for why there were no significant 
differences between groups. Eleven days at 25 minutes a day may have ^>een 
too short of an Intervention. With a longer treatment period, it would have been 
more certain that an unacceptable level of performance was attributable to other 
factors. Further, observations of student performance during the siudy indicated 
that many students typically ignored prompts on i^e screen that told them what 
to do next. Hence, through a failure to attend the students may well have 
missed opportunities to learn from their enrors. 

It is also conceivable thai mildly handicapped students may need more 
teacher-directed instruction before using a computer fcr additional practice 
opportunities. The presentation of the problem-solving strategy on the 
computer lacked the subtlety and flexibility that a teacher adds to instruction. 
Good teachers gather a considerable amount of information about how students 
are learning a new sidll - particularly one as difficult as problem solving - ar " 
modify their teaching accordingly. Observations of teacher directed instruction 
that uses the same strategy that is presented in the Analyzing Word Problems 
program support this hypothesis (Gleason, 1985). Therefore, the most 
appropriate use of a computer for students such as these may be for guided 
practice (i.e.. as a medium for reviewing material that is already familiar to the 
students). 

«^imtiiations 
Hoj=ifth Wayfi' Problpm Rnlx/inn Skills 
Secondary students spend a considerable amoum of their time completing 
seatworit activities (Doyle. 1983). These academic tasks often involve higher 
order cognitive skills, and students are asked to make a variety of inferences 



about a subject area by pruderrtly using facts, concepts, and strategies or 
problem solving skills. Some writers (Doob. 1972: GreenblatA Duke. 1975; 
Budoff. Thormann. & Gras. 1984) have suggested that one way to enhance the 
higher order skills of students is through educational simulations. 

While much of the research has concluded that computer and non- 
computer simulations are no more effective than conventional instruction, many 
of these studies have been plagued by fundamental weaknesses in research 
design (Fletcher. 1971 ; Pierfy. 1977). In the study below, we addressed many 
of these problems and created an Instrument that reflected the problem solving 
skills actually taught in the computer simulation. Rnally. we have addressed a 
curious feature of previous simulation research: the general reluctance to 
combine simulation instruction with conventional Instruction. 

Only on a few occasions have simulations and conventional instruction 
been compared to conventional Instwclion alone. Nor is It clear In most of 
these studies what constitutes conventional Instruction. O. of our Interests in 
studying simulations was to investigate how effective instfuciicnal practices 
couW be used to enhance - rather than replace - seconu^ry level Instruction, not 
only in terms of their effect on basic fact and concept retention, but as they relate 
to higher order skills (Woodward. Gamine. & Gersten. 1986). 
Method 

Thirty mildly handicapped high school studen:s from a pool of 38 potential 
students were randomly assigned to either the conventional or simulation 
condition. Students were selected for the study on the basis of Metropolrtan 
Achievement Test (MAT) reading comprehension scores. Only those who had. 
at a minimum, a sixth grade reading level or were at least two years below 
grade level in comprehension were selected for the study. 



All Students were Instmcted tor 40 minutes per day tor twelve days. The 
lesson consisted of two parts. The first part, called stnifitl i mf l '-^hing . was 
identical tor subjects in both student conditions. Instruction was conducted in a 
large group of 12 to 15 students tor this part of each lesson. 

At the end of the initial Instruction, students separated into two groups - 
one which worked on application activities (the conventional group) and the - 
other with the computer simulation (the simulation group). The conventional 
group worked in the resource room under the supen/ision of the resource room 
teacher, who presented these students with a variev; of application or review 
activities. 

Simuiatlon students, on the other hand, were taujjht in a computer lab. 
each student working individually at a microcomputer. The twelve day course of 
instruction tor these students was broken into three phases: initial modeling 
(three days), guided practice on three simulation games (two days), and 
independent practice with individua' feedback from the instructor (seven days). 

Th ft r.A\ Program. H«*a'^>' Wavs is a commercial software program 
developed by Camine. Ung. and Wong tor the Apple II and IBM PCjr 
computers. The simulation provkles extensive practice on analyzing health 
profiles. An initial screen showing profile characteristtos can be seen in Rgure 
2. The H^^Hh Wavs sun rlftmftr^''T an accompanying written 

curriculum devetoped by Woodward and Gumey, extended intormation 
presented in HoaHh Wavs and the original bflaimjfiiajfi teachers guide. 
Matorial was taken directly from two widely used junior high school health 
textbooks. All of the Intormation was rewritten to control tor vocabulary and 
amount of new intormation. CHppings from newspapers, news magazines, 
journal articles, and health pamphlets were also used in this supplementary 
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curriculum. The reading 'svel of the curriculum is approximately sixth grade. 

(Insert Figure 2 about here) 
Maasuras. students were assessed one day. two days, and two weeks 
toliowing the inst.iiCtion. On the first day. studenfs acquisition of basto facs and 
concepts about heaHh taught In the curriculum was measured by the Health 
ways Nutrition and Disease Test. The first 20 questions of this test were solely 
from the written curriculum. The remaining 10 ^re questions over material that 
appears in both the written curriculum and the ^)tim simulation. This 
latter section will be referred to as the t^inlQlCfill section of the test because 
infom^ation pertaining to the items was reviewed in the computer simulation, 
internal consistency reliabiiity (coefficient alpha) of this measure is .84. On the 
second day. the students were given the Health Ways Diagnosis Test, an 
individually administered test measuring prioritizing skills. This test was a set of 
three written profiles and measured health related problem <.olving skills (i.e.. 
the student's ability to detect important health problems facing an individual, 
identify and change related health habits, and control stress as it increased cue 
to tne health changes). The Hearth Ways Diagnosis Test has a test - retest 
reliability of .81. Two weeks after the Instmction the students were again given 
the Health Ways Nutritior. i.nd Disease Test. This served as a retention 

measure. 

pesults 

^„^H„.y,,,.^,.H..onp.Hni..«..T«st. This tost was usad in both tt,e 
posttest and maintenance (retention) phases, a 2 x 2 analysis of variance 
(ANOVA) was used. The 30 items in the test we« broken into two subseales: a) 
items reinforced by the baaUHJfiiajffl simulation and b) items taught i.i the 
curriculu-T. and not reinforced by the simulation. Separate 2 x 2 ANOVAs with 



ERIC 



18 



seated measu^swereperto^nedon each subsca.. The e«ea on -.ems 

^viewed or reinforced by HflallllJCfiaS*^^^^^^^ <^< '>'^^"' 
™ns^n>«can.,or.hose.en.no.rein,orceda.<.0e).Tab.4presen^^^^ 

*scHp.«es.a«s.>cs.orrein,orcedsec«ono.*e.e«.Tnis.n<.ca«s^at.^^^^ 

3«a«on was an e«ec«ve veh^e ,or revie^n, n,a.eHa. ma. had .ready been 

taught in the written curriculum. 

[Insert Table 4 about here] 
,„.,^„,,,,^r>,..n...Test. Hests per*=m,ed on .his .es. demons.ra.e a 
signiflcan. di«erence be.ween «,e «o groups «> < .001 ) in prob.n, soMn, 
Sin,u.a«on s.uden.s were be«er able « diagnose heai.h probien,s^ 
priort«ze «.en, as .0 tt,eir eHeCs on an individuaTs longevi.y. and prescnbe 
appropha.e™medies. Mhough nK>s. simu^ion sn.den« <«d no. recede 
; ,c.scoreson.hisn,e.ureC..e...hey.o.^.^s.-egypresen.ed.n. ^ 
Ln,en.e,ao..).por.onnan=ewas.mos.a^ysabove.hecn.enase.bv«,.e 

experin^nlers tor appropHa.e diagnosis and ,he suggested sequence 0. 
remedies. 

The resuKs 0. .his sfcidy suppon *e use 0. computer simuiatons ,n 
,eachingn,a.eHa,no.ea...augh.by.rad«iona.n,eans. Fur.her. a ^.ctured 
approach in simu^«ons. one where ou.comes are sped«ed and con.roiied. 

does p.c3uce significan. educa«onal resulte. 

we infer .«>m.heresu«s.ha..heex,id.s.ra«gyins.ruc.on used. . each 

„e simu.a«on s.uden.s abou, was a success*,. bHdge « .ess 

.reCins.-uc«onac«v«es. Kisno.c.ear,ha.ccn,pu.ersimuia.K)ns.by 
.en,seives.areadepua.ein.eachingh,heror*rs.l..For..sreaso.^^^ 
™igK.specu^.e.ha.addedins.n^onisnecessa,y:a..eas..o.hepo,n..ha.« 



focuses the student on the instructional goals of the simulation. 

The results of these four studies Indicate that properly designed CAI can 
b-? effective as an instructional medium. These findings are consistent with our 
non-computer research that we have conducted over the last ten years. 
Sophisticated design principles can make a considerable difference in the - 
effectiveness of any instructional program. Another outcome implicit in these 
studies Is that they begin to identify - with much greater clarity - the role of the 
teacher and his/her instruction away from the computer. This perspective 
deviates from original questions about computer assisted instruction (e.g.. Is 
CAI more effective or efficient than conventional instruction?). It forces us to 
look closely at the intersection of the te?cher. the academic task, and the stage 
of instruction (e.g.. Is the skill just being introduced, practiced, or reviewed?). 

The Vocabulary Instmction study, for example, demonstrates that a skill 
requiring considerable practice can be adequately taught on a computer. Such 
a task is time consuming for a teacher and can be handled effectively by CAI. 
Furthemiore. there is very little variation as instrucUon moves from one stage of 
instruction to the next (i.e.. from introduction and modeling to guided practice to 
independent practice). Note, however, that the task, as it was defined in the 
study, was one of memorizing vocabulary words. We did not teach nor assume 
that students would necessarily learn how to use the words expressively or 
detect their meaning from context. This would have required a different 
analysis. 

The Reasoning Skills program, a teacher independent tutorial, was 
successful at teaching a more complicated academic task: logical inferences. It 
might be argued that the particular academic skills taught in this program were 



more discrete than. say. math word problems or the subtle problem solving 
skills addressed in the Health Ways study. If this observation is correct, then 
computer assisted instruction - carefully developed with instructional design 
principles and field tested - can be an effective, "stand alone" iom of instruction. 
As with the vocabulary program, such instruction is efficient insofar as it allows 
the teacher to attend to other instructional activitie. while students work on the 
computer. 

Math word problems, on the other hand, do no share the same task 
complexity as the syllogisms. In this study, we speculate the best way to teach 
skills such as math word problems, notoriously difficult for mildly handicapped 
students, may be through teacher directed instruction first, with the computer 
used for guided practice. This conclusion is admittedly tentative, and we will 
conduct another study after we revise our strategy. 

Fmally. the Health Ways study strongly suggests that facts and concepts, 
which were preskills to the problem solving activities, can be efficiently taught in 
group instruction without computers. The computer can be an effective tool 
after the preskills have been introduced and explicit strategies for using the 
simulation have been taught by the teacher. In this sense, a complex task like 
problem solving can be effectively taught in the guided and independent 
practice phases of Instruction. 

As a program of research, these studies dr-)onstrate that empirically- 
based instructional design principles are applicable across different types of 
CAI programs. Although the research focuses on these principles, we can 
extract some recommendations about teacher practices. With effective softwara. 
the teacher can use CAI programs for the purpose of efficiency (i.e.. letting the 
computer teach low level or time consuming tasks) or to present material that 



would bo difficult to do through traditional media. The latter was certainly the 
case with the Health Ways simulation. Looking beyond the specific outcomes of 
this research, an optimal use of computer assisted Instmction requires more 
than a set of design principles. It entails a careful consideration of the academic 
taslc. the stage of Instruction, and the role of the teacher. 
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Table 1 



Mean Score. Standard Deviation, and Mean Percent Correct on SO-ltem Prete^, 
SsTand Maintenance Test for Small Teaching Set and large Teaching Set 

Samples: Tim^to-Mastery Design 



Pretest 



Mean Mean 
Percent Percent 
H M ^ Correct M SD. iMsA 



Group 

Teaching Set 12 24.7 8.1 49.4 42.0 4.0 84.0 



f^aintftnanee Test 

Mean 
Percent 
^ ^ Correct 



40.5 7.1 81 .0 



Large 



Teaching Set 133 24.9 7.8 49.8 43.7 7.7 87.4 42.0 8.7 84.0 



aone subject completed only seven sessions. This subject's posttest scores appear 
only in the fixed-time design. 



Table 2 



Mmh Scora Standard Deviation, and Mean Percent Correct on Test Formal 
Logic^pSe!r?o^e^^lS Test for the Elaborated and Bas.c Correction 

Groups 

Pretest Pflsttest MnlntftHance Test 

Mean Mean Mean 

Percent Percent Percent 

ciSns 17 10.8 4.3 30.7 24.6 5.0 70.4 25.1 3.7 71.6 



C^1«ions 17 11.5 4.2 32.9 21.5 6.6 61.1 20.6 7.5 59.0 



Tabled 



Maan Score Standard Deviation, and Mean Percent Correct on the 2Wtem Story 
^Sbtem rSt: P?S tor the Analyzing Word Problems and Semantic 

Calculator Groups 



pffttest 



Analyzing 

Word Problems 13 



Mean 
Percent 

13.5 3.7 48.0 



Mean 
Percent 



14.8 5.5 



53.0 



Semantic 
Calculator 



13 13.9 3.5 50.0 



11.6 6.3 41.0 



oo 



Tabl«4 

the ufiauLXXflsa wum simutetton Groups 



proup 



Mean 
Pft.'oent 



^alntarmnettTflSt 

Mean 

Percent 
^ Qaaasi 



Simulation 15 7.3 1.4 73.3 
Conventional 15 5.6 2.2 56.0 



6.5 
5.0 



2.0 
2.2 



65.3 
50.0 



Rgura 1 



In^jfriif^lnnfll Dftslon Variflblas 

Example Review CorrecUon ExpOdt 
Set Size fi^^dfiS Procedures Strategies 

Studies 

Vocabulary Instruction 
Math Word Problems 
Reasoning Skills 
Health Problem Solving 



X 
X 



X 
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Effects of Instructional Design Variables on 
Vocabulary Acquisition of LD Students: 
A Study of Computer-Assisted Instruction 

Gary Johnson, Russell Gersten, and Douglas Gamine 



tioehwgZlpraaicitxirtisaonsmattsm^ 
anwordsikesn^liomediniheprci^ 

se Jnvttm ^ed mastery wiMn 11 Mssions than mdmiu ^ ' «r 
^rt^r&ir in both pfopam faimaf ax ^(^^^^^ 

Yeone^pliomidthematmalmoft^l^ 
dimniotiantmtrm4€rmiasum.anofd 

eompr^ension test 



Based on the premise that word 
knowledge correlates highly with 
leading comprehension (AnderMn 9l 
Freebody. 1981; Merynsli 1983; Pear- 
sen & GaUasher. 1983; StahU 1983; 
Ticmcy ft Cuaalngham, 1984)» sever- 
al investigators have attempted to im- 
prove smdents' leading c6mprchcn* 
ston skilU by teachuf vocabulary. In 
only a few of these ^dies were sub- 
jects identified as either low perform* 
ing or learning disabled. 

While lU smdics produced evi- 
dcncc of improved vocabulary knowl- 
edge, the effects on stadtng com- 
prehension have been varied and 
inconclusive. However, the methods 
which were most successful in teach- 
ing new vocabulary consumed the 
most instructional time, and even 
then the gains, in terms of numbers of 
^rds learned, were modest An area 
in net u research is the identifica- 
tion of :e:liods for increasing the 
direct instruction of vocabulary for 
low performing smdents. An impor- 
tant concern, particularly with mildly 
handicapped smdents. is increasing 



instnictional time without increasing 
the demands on teachers* alteady Urn* 
ited time. One potential vebkle for 
delivering this instniction U compu- 
ter-as^sted vocabulary instruction. 

INSTRUCTIONAL DESIGN 
FOR COMPUTER* 
ASSISTED INSTRUCTION 



Features of computer^tssisted in- 
struction (CAI) trtiich arc seen as ad- 
vantageous for instmction with spe- 
cial education smdentt include 
individualization and Klf*padng. im- 
mediate feedback about performance, 
consistent correction procedures, pa- 
tient lepetition, carefully sequenced 
instmction. freqtient student respond- 
ing, and motivation (Budoft Thor- 
mann. & Gras. 1984). Yet much cur- 
rently existing software fails to provide 
these feamres in programs for special 
education students (Thormann. Ger- 
stcn. Moore. & Morvant in press). Twc 
instructional design considerations 
that were explored »n this smdy were 
(a) optimal set sii,. for introducing 
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miw^nUand(b) schedules of re- 
Relevant rejearch on each vatiai 
discussed below. 

"Set Size** for Daily Lmsw! : 

Drill and pnctice programs ai 
most widely used CAI software 
doff et aL 1984). but litde attentio 
been directed to increasing th 
fidencyandeflicacyof^ dril . 
prac .;* strategies (Merrill ft ! 
buiy. 19S4: Sief^l * Misselt 19S4 . 
area of needed teseaidi U the 1 
mination of the appropriate size t 
teaming set for lessons in a dril r 

practice program. Conccms for . 
ricttlumdesignersarehowmany - 

can reasonably be presented d- , 
each lessonandhowthcsctsiiea 
the learner's ability » master 
lemember materiaL TbU stud> 
dresses that issue by evahutini 
software programs wiA a vet} 
fetent approach towards introdu 
set size. 

The earty behavioral smdics (L ' 
& Hulsc. 1967: Furukawa. 1970: 
1971:McGeoch Alrion. l952)on 
mmmed instruction lend suppt 
MiUer's (1556) 'magical nunbe 
seven as the optimal set size (i.e 
average number of ttwclated c : 
letters, or words that an adult 
lecaU after one ejtposuie). Bet : 
most of the research in this are:. . 
been conducted with coUege level 
jects(ci:Deese&Hulse,1967: 
1971: McGeoch&Irion. 1952), an 
results of snipes with scbool*age 
dren were equivocal further rest 
with sehool*age children «dt. lear 
diubilities is needed. One purpc 
the present $«udy was to compar 
effects of different set sues on th 
quisition of word meanings by ir 
handicapped students. . 



Review Schedule 

Ihe element of review was ai 
portant variable in the Beck. ?ti 
and McKeown (1982) study of vi 
uiaty acquisition. Words were pre 
ed in two different levels of frequ. 
In one freatroer.c. each set of » 
was taught vx a 5^y cyclfc Ai 
words in the set were introduce 
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the fint day of the cycle and reviewed 
whf n necessaiy. but only during that 
5^y cycle. Theie was no subsequent 
teview. In a second treatroen words 
weie introduced in the same manner, 
but ad^tional practice was provided 
after the fifth day in special review 
cydss. Words in thU treatment ap- 
peared 16 to 22 additmal dma each. 

The effects of the extra reviews were 
dear. Smdents remembered more of 
the words which received special re- 
views, and on a 'exical decision test, 
smdentt were able to access the defim- 
$on of these words more quickly. A 
replication of the original study by 
McKeown, Beck. Omanson. and Per- 
fetd (1983) again demonstrated that 
extra review enhanced smdents' abil- 
ity answer multiple choice ques- 
tions about story passages. In the repli- 
cation, the extra reviews positively 
affected both smdents' rertntion of 
word meanings and their comprehen- 
sion of passages containing instructed 
words. 

Research in computer-assisted m- 
strucnon has demonstrated that sev- 
eral short-spaced reviews are more ef- 
fective in increasing retention than are 
a few massed reviews (Gay. 1983; 
Siegel Sc. Misselt. 198*). 

Merrill and Salisbury (1984) pro- 
pose a strategy thst would provide 
spaced reviews during a CAI drill and 
practice program. New items are pre- 
seated to smdents. and only items they 
do not know become part of a "work- 
ing pool" Tbe number of items in the 
working pool would be determined 
" empirically (which is one purpose of 
the present smdy). Once the smdcnt 
meets a specified criterion on an i'> 
in the working pool, that item is re- 
moved and placed in a "review pool" 
Each item in the review poo', is re- 
vievred on specified dates for a speci- 
fied number of times. 

Description of the Software 
Programs Used In the Sti.dy 

Two programs were used in the 
smdy. The comparison program, 
called the Large Teaching Set, was a 
_ commercial program that used larger 
teaching sct« and did not provide 
^- mulative review. The experimental 
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program, the Small Teaching Set was 
intended to exemplify the two prin- 
ciples of instrocticnal design dis- 
cussed above-^timal set size and 
cumulative review. 

The distinctive instructional design 
feamresof the Small Teaching Set pro- 
gram include (a) individualixpd les- 
sons which provide teaching and prac- 
tice only on words the smdent does not 
Vcnow. (b) a practice set which consists 
of no more than seven words at any 
time, (c) a specified mastery cntenon 
which must be met two cons«amve 
lessons before a word is considered 
learned, and (d) cumulative reviews on 
learned words to ensure retention. 

The iwview procedures of the two 
programs are different The Small 
Teaching Set program provides daUy 
teview on words in the swdent s prac- 
tice set and periodic cumulative review 
of vwrds whicl. the smdent has 
learned in the program. Once the sw- 
dent hM mastered 10 words, the pro- 
gram prcsentt a cumulative review 
lesson on those words. In contrast the 
Urge Teaching Set keeps no cumula- 
tive record of swdent errors, so no 
cumulative teview is provided. 
TheUrgeTeachingSetwas adapted 

from a commercial program devel- 
oped by Davidson and Eckett (1983). 
Words are ttught in sets of 25. The stu- 
dent may choose to see the words in 
any of four types of formatr. (a) a 
teaching dispUy which shows the 
word, its definition, and one example 
sentence; (b) a multiple choi« quiz 
format; (c) an exercise in which a 
definition U displayed and the smdent 
must spell in the correct missing word 
to complete a sentence: and (d) an 
arcade-type game in which the smdent 
matches words to their definitions. 

In summary, then, the purpose of 
this smdy was to compare two meth- 
ods of computer-assisted instruction 
for teaching vocabulary to mildly 
handicapped adolescents. The smdy 
examined the effect of various cycles 
of practice and review on the acquisi- 
tion and maintenance of word mean- 
ings. Two CAI vocabulary programs 
vwre used to present the same 50 words 
and definitions. ;» .s size of the daily 
teaching sets and previsions for daily 
and cumulative review varied between 



lilt two programs. Acquisition was 
assessed by multiple choice measures. 
In addition, smdents were asked to 
define 10 of the words and answer 
written comprehension questions on 
passages containing 10 cf the most fre- 
quently missed words. A maintenance 
ftdministeftd 2 weeks after in- 



test wt$ tdministeted 2 weeks after in- 
struction ended 

METHOD 



Subjects and Setting 

Tbiity^ght high school smdents 
with learning diubilities in grades 9 
thfOtt^ 12 were considered eligible 
for the study. Each of these smdents 
scored at least 3 years l>clow grade 
level on the Reading subtest of the 
Woodcock-Johnson (Woodcock & 
Johnson, 1977). Because the Wood- 
cock-Johnson had been administered 
at different times to different smdents 
by the district the experimenters 
administered the Advanced I Level 
Reading subtest of the Metropolian 
Achievement Test (Prcscott Balow, 
Hogen, & Farr. 1978) to all smdents. 
The test was administered to the entire 
group 3 weeks after the conclusion of 
the smdy. 

The mean performance of the entire 
sample corresponded to the eighth 
percentUe. Scores ranged from the 1st 
to 22nd percentile. In contrast tSe 
mean score for the district was at ap- 
proximately the 72nd percentile. Thus, 
all students performed significantly 
below their peers on sundardized 
measures of reading achievement 

All smdents were administered a 
multiple choice, 50-ium vocabulary 
pattest They were aien matched by 
pietes; scores and randomly assigned 
to one of the two treatments, the Small 
Teaching Set program cr the Urge 
Teaching Set program. Six sradents 
who scored over correct on the 
pretest were excluded from the smdy. 
Two smdents decided not to par- 
ticipate. During the smdy, four sm- 
dents were dropped due to frequent 
absences, aud one smdent was 
dropped when his performance in- 
dicated that his pretest score was inac- 
curate. Thus, a total of 25 subjccu ac- 
tually participated in the smdy. 
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The study was conducted in a large, 
special education resource classroom 
in a high school in the northwest. IBM 
computers and color monitors were set 
up in the back of the classroonu away 
from other instructional groups. 

Materials 
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The Small Teaching Set program 
(Caminc. Rankin. & Granzin, 1984) 
constructs individualized CAI vocab* 
ulary lessons by first testing a student 
on new words and then composing 
teaching and practice sets of only 
those words which the student does 
not know. An example of a teaching 
frame appears in Table L 

The exercises in the practice set con- 
sist of three types of multiple choice 
items: the new word appearing alone 
with the correct definit: as one of 
five choices, the word appearing in a 
sentence with the correct definition as 
one of five choices, and a synonym (or 
short definition) for the word appear- 
ing in a sentence with the word as one 
of five choices. Examples of practice 
exercises appear in Table 1 For the 
practice exercises, the program picks 
from a pool of four items and ran- 
domly selects items to present. The 
student must get two items per word 
correct before the lesson ends, unless 
time runs out and the student selects 
the **cscape'' option to terminate the 
lesson. 

Ir. order to reach mastery criterion 
on a word and have it removed from 
the practice set, the student must iden- 
tify the word's meaning two consecu- 
tive times on two consecutive lessons, 
or, in other words, four times in a row 
across two lessons. The word then 
becomes a '*leamed'' word and moves 
from the practice set to the "review 
set." Once 10 words have been 
^•leamed" and moved to the review set, 
the program provides a cumulative 
review test on the review set Any 
words missed on this cumulative 
review test are put back in the practice 
set, and the student must again meet 
mastery criterion on '* ^ word. 

The Large Teachiu^ Set program 
(Davidson & Eckert, 1983) teaches 
words in sets of 25. The program 
comes with nine levels of 75 words 
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each for grades 4 through 11 However, 
for the purposes of this study, the same 
50 words used in the Small Teaching 
Set program appeared in the Large 
Teaching Set program as two sets of 25 
words (sec "Word Selection* below). 

Each time the program is run, the 
student goes through the same 25 
words in the same order. Unlike the 
small set programs, some of the words 
are words the student already knows, 
since there is no individualization. At 
the beginning of the lesson, the pro- 
gram presents a menu with a choice of 
four formats: "word display," "multi- 
ple choice quiz," ^sentence compl::- 
tion,* or an arcade-type game. 

These activities include two word 
display and multiple choice quizzes 
similar to the Small Teaching Set pro- 
gram, and two that are quite diflerent. 
Sentence completion involves spelling 
the new words, and the arcade activity 
involves matching exercises in a game 
format. (For details, sec Johnson, 1985, 
pp. 29-34,44-51.) 



had yet to meet mastery C*currer 
reviewing") and words mastered (* 
ready learned"). 

Sdectioa of Wonb for the Stu 
The Large Teaching Set program p 
vides words, definitions, and exerci 
for 25 nouns, 25 verbs, and 25 adj- 
tivcs for each level. Priorto the studj 
list of these 450 words was given to 
middle school and high school edui 
tion teachers in the district in whi 
the study was to be conducted Thi 
tcachen picked words from this 1 
which they considered important a 
useful for mildly handicapped Si 
ondary special education students 
know. An initial list was constructed 
107 words which were considered ii 
portant by two or more special educ 
tion teachers. A final list of 25 ver 
and 25 adjectives was developed t 
use in the study. All of these wor 
were from the words commonly co 
ered in grades, 7, 8, and 9. 



FeedbM€ktoStadeots. Both CAI pro- 
grams provided comparable immedi- 
ate feedback to subjects on the ac- 
curacy of their responses. In the Large 
Teaching Set program, when the sub- 
ject answered an item correctly, 
a message sucL as "^ice going* or 
'*Kcep it up, (name)," appeared. When 
the subject answered an item correctly 
in the Small Teaching Set program, 
the message, Tes, the answer is 
appeared 
The arcade-type game provided a 
type of reinforcement not available in 
the Small Teaching Set program. 
When the subject accurately **shot" the 
correct atiswer, the answer was mo- 
mentarily highlighted, and a score for 
that shot appeared briefly in the mid- 
dle of the screen. Accompanying 
sound eficcts were turned ofT in order 
not to distract other smdents and 
teachers in the room. 

Both programs also provided feed- 
back on the number of words correct 
The Large Teaching Set program did 
this by giving the subject a percentage 
correct score at the end of an activity 
and then displaying any words 
missed. The Small Teaching Set pro- 
gram listed words on which the subject 
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Tabtel 

A TMcning Prvm frotn ITm SmaU 
TMChifig S«t Program 

Th# wore HSTABUSH mttns SET UP. 

Susit win SSTABUSH a new procadura «c 
our maatings. 

Susia wiil SET UP a naw procadura for our 
maatin^. 

ThabanKisgoirgtoSETUPtnawbraftcftor ■ 
tha omar stOf of town. 
Tha bank i ♦ yotng to ESTABLISH a nav- 
branch on ha oihar SKja ot town. 



Tabia2 
TWO Practioa Forma from tha 
Small Taaching $at Program 

Thay ara working to ESTABLISH an 
organization to protact whales. 

1. maka legal 

2. atect 

3. funo 
^. se: UP 
5. jOih 

Tha doctors ara going to SET UP a new aye 
care clmic 

1. employ 

2. attend 

3. operate 

4. cancel 

5. establish 
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The same 50 words. 25 adjectives, 
and 25 verbs were entered and used in 
both the Small Teaching Set and Large 
Teaching Set programs. The same def- 
initions iwerc used in both the Small 
Teaching Set and Large Teaching Set 
programs. For the purposes of the 
smdy. exercises written for use in the 
Small Teaching Set program were the 
same or very similar to items which 
appeared in the Large Teaching Set 
program. The differences between the 
effects of the programs, if any. were in* 
tended to be a function of instruc- 
tional design features. 

Procedures 

Following pretesting (sec "Mea- 
surest below), all subjects received 
computer-assisted vocabulary instruc- 
tion during a 20-minute session each 
Monday through Thursday. Since the 
45-niinuie periods were divided into 
two separate sessions, some subjects 
began the period with a computer ses- 
sion and then returned to their regular 
instructional group, while other sub- 
jects first attended their instructional 
group and then completed a session 
ou the computer* 

The experimenter for the swdy was 
a doctoral smdeni in special education 
/ at the University cf Oregon. The ex- 
perimenter was present for each ses- 
sion to ensure that the sessions lasted 
exactly 20 minutes, that subjects ac- 
tively worked on the computer with 
minimal talking, that they completed 
as many lesson activities as possible 
during the 20 minutes, and that they 
took the optional reviews of missed 
wrds at the end of the "multiple 
choice quiz* exercises in the Large 
Teaching Set program- The experi- 
menter also completed checklists on 
each subject's daily progress. 

FmiliMriutioo with the Computer 
MJtd Word ReMdittg Pnctice. During 
the first 5 minutes of sessions 1 arid 2, 
the experimenter taught the subjects 
how to load the program disks and 
start up their programs. Most subjects 
had little, if any. experience operating 
a computer. During the next 5 minutes 
of the first two sessions the '"xperi- 
O tenter provided word ixading pnic- 
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net on words that were to appear in 
the program- The words were printed 
in short columns on two practice 
sheets, and subjects took nims reading 
columns out loud If a subject misread 
a word, the experimenter told the sub- 
ject the word and directed the subject 
to read the word and reread the 
column from the beginning. 

Eight subjects, four in each treat- 
ment, displayed difficulty in ac- 
curately decoding and pmnoundng 
words during word reading practice. 
Two additional word practice sessions 
were provided to these sntdents. 

Length of Sesshtts. The time sched- 
uled for each daily computer session 
was exactly 20 minutes. The number of 
lessons that the subjects completed 
varied from o le to two lessons. If the 
subject was in the middle of a lesson 
when the 20-minute session was about 
to end. the experimenter directed the 
subject to use the '^escape" option to 
terminate the lesson on dme. If the **es- 
cape* cption was selected, no credit 
toward mastery was counteu^ for words 
pracnced during the abortei lesson. 



MMSteryCriterioaforLMTgeTeMdiiat 
Set The experimenter told the sub- 
jects in the Large Teaching Set pro- 
gram that their goal was to get a score 
of 84% correct (21 correct out of 25). 
They were told that If they scored 84% 
or higher, they could play th^: arcade- 
type game, and if they scored 84% or 
higher on two consecutive days, they 
would move on to a new activity. 

On each day that ±t subject scored 
84% correct or more, the subject com- 
pleted the reviews and then spent the 
remainder of the session playing the 
arcade game, u. ually no more than 
twice in the time remaining. After 
meeting the criterion of 2 days at 84% 
correct or more on the multiple choice 
quiz, the subject was told to select the 
sentence completion activity; this was 
done only once, without review. 

Aft^r meeting criterion on the multi- 
ple choict quiz and doing the sentence 
completion activity one time, the sub- 
ject began the second list of 25 words, 
the same 25 verbs which were words 26 
through 50 in the Small Teaching 
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program. The subject followed the 
same sequence of activities for this 
second list of words. Once the subject 
completed all activities on both word 
sets, the subject was considered to 
have met mastery criterion for the 
smdy. 



Mcf^ures 

Pretest, PoMest, tutd MMmtetiMace 
Test A 50-item, mulHplc choice test 
requiring the student u> select the cor- 
rect definition of a word was devel- 
oped for use in this sfudy. Items were 
similar to what appeared in the prac- 
tice frames (see Table 2). This instni- 
ment had a coefficient alpha reliabil- 
ity of 0.79. The pretest to posnest 
correlation was .68. 

Additioaml MeMsares. All subjects 
took a 10-item. open-ended oral test 

fWhat does mean?!. The test 

was designed to measure the studenf s 
abiUty to recall rht definition of the 
words tau^t The training only in- 
volved recogrution. a much easier re-- 
sponse mode. The tester wrote down as 
much of the subjects responses as 
possible and also audiotaped the test 
For each word the subject was asked to 
give a definition and to use the word in 
a sentence. After a subject met mastery 
criterion (or after session 11 for those 
subjects who did not meet nustery), a 
test consisting of three passages ar 1 
accompanying inferential compre- 
hension questions was admirustered. 
These passages contained a total of 10 
verbs which were most frequendy 
missed on the pictest The passages 
were designed to asv-ss subjects' un- 
derstanding of rht wo.-ds in contexts 
otiiertiian die sample scatencesv^ftich 
appeared in the progrr.ms. 

An example fmm the comprehen- 
sion test appears in Table 3. 

. Attitude Surrey. The attimde survey 
questioned students about working on 
the computer and the specific CAI 
programs. The items asked the stu- 
dents how they felt about working on 
the computer and how much they felt 
they learned. 
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TabNi3 

S«mpte Ittm from C o m prth»nt ion tdtt 

Dents« •n|oy«d h^r t>«cK yard. In me lail. 
m« yard was covt rad with iaavat. Otrusa 
had procrastma:ad. Saturday was coot and 
cnsp. 0«n«a dtctdad to raka tht laavaa. At 
(ifst her hand fait cold and s:itf on the rake 
handle. Soon, she acdimaiad. She enjoyed 
the clear, sunny skiea and the rustle of the 



1. Oentse raked the leaves m her yard 

a. before she was supposed to. 

b. just when she was supposed to. 

c. after she was supposed to. 
Z When Oenise finished raking. 

a. her hands were still stiff . 

b. her hands felt fine, 

c. her hands were hot 

3. Oenise enjoyed the clear, sunny skies 
and 

a. the sound of the leaves. 

b. iht fall colors. 

c. the smell of the leaves. 



RESULTS 
Time to Mastery 



A timc-to-mastcry design was used 
to examine whether there was a sig- 
nificant difference between tlie times 
required to reach mastery by subjects 
in the two programs, "^aole 4 presents a 
summary of the number and percent- 
ages 0 f subjects who met mastery with- 
in 1 1 sessions and the mean number of 
sessions to mastery for subjects in both 
treatments. The study was terminated 
after the 11th session because the ex- 
perimenter felt that the subjects who 
were still .struggling to reach mastery 
were no Icngcr benefiting from in- 
sirucrioa. Fcr the students who met 
criteria, the rr»tan number of sessions 
to mastery was 7.6 for subjects in the 
Small Teaching Set program and 9.1 
for subjects in the Large Teaching Set 
program. Results of a /-test indicate 
that this difference in sessions to mas- 
tery is statistically significant, r(16) - 
1.87, p < .05. 

Posttest and Maintenance Test 

The 50-item, criterion-referenced, 
multiple choice test was administered 
to each subject as a posttest after meet- 
ing mastery.. Those subjects who did 
not meet mastery by the end of the 1 1 th 
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session were administered the multi- 
ple choice test after session 11. The 
same test was re-administered (with 
the order of items changed in a ran- 
dom fashion) as a maintenance test 2 
weeks later. A summary of pretest 
posttest and maintenance test n^sulis 
is presented in Table S. 

A 2 X 2 analysis of variance was 
performed on the posttest and main- 
tenance test scores (sec Note 1). TTie 
betwcen-subjccts factor was type of in- 
structioit and the within-subjects fac- 
tor was time of testing. Results of the 
ANOVA indicated that there was no 
effect for type of instruction, « 
0J3. Results of the ANOVA demon- 
strated that there was a slight drop in 
performance between posttest and 
maintenance test for subjects in both 
groups.F„^ « 4.94.P < .05. Mean per- 
formance was close to mastery level 
for both groups on both measures. 84% 
to 87% on the posttest and 81% to 84% 
on the maintenance test Subjects in 
both programs learned as much, as 
measured by the cntcrion-referenced 
posttest and retained as much, as 
measured by the maintenance test 



Additional Measures 

Each subject was adminif tered 
open-ended oral test on wonJ mean- 
ings after session seven. A maximum 
of 2 points was awarded to each item. 



1 for a conect definition and I for a 
appropriate sentence. Panial credit ( 
point) was given to responses whii. 
were correct but incomplete. Results • 
a r-test on subjects* scores on th 
measure indicated that differences bi 
tween groups were nonsignificar. 
f(22) ■ .45. The mean was 6.4 for tb 
Small Teaching Set group and 7.2 fc 
the Large Teaching Set group: star 
dard deviations were 4.7 and 4.4. n 
spectively. On the written compreher 
sion test means were 5.1 (SD » 1.4) fc 
the Small Set and 4.7 (SD « 11) for th 
Large Set This difference was nc 
significant 

Attitude Sunrey 

Results of the attiwde survey ir 
dicated that for the most part subjec* 
responded favorably toward corr 
puter-assisted instruction and the pn 
grams. Twenty-three of twenty-fou 
subjects felt the computer helped ther 
leam new words, and one subject it 
dicated that **maybc" the compute 
helped 

In answer to the question •'Did yo 
enjoy working on the computer^ sub 
jects answered on a 4-point scale, wiii 
1 being **not very much" and 4 t-in 
"very much^The mean scores were 3. 
for subjects in the Small Teaching Se 
program and 18 for subjects in th* 
Large Teaching Set program. Result 



P*rc«nt«9« of Subftctt Reaching MMl«ry WttNn 11 SMtioM and M««n Numbtr of 







Numbtf of 


Ptfoantage 










Subiecta 


of Subieeta 


Mean Number 








Reaching 


Reaching 


. • Seaaione 


SUndard 


Group 


N 


Maatery 


lyiastenr 


fo Maatery 


Deviation 


Sm«l) Teaching Set 


12 


10 


83 


7.6 


1.9 


Large Teaching Set 


12 


8 


67 


9.1 


1.5 



Tables 

Mean Scof«» SUndard Deviation, and Mean Percentage Correct on SO^tem Preteat, 
Poetteat, and Maintenance Teata 



Group 



SD Mean % 



Poetteat 
50 Mean % 



Maintenanot Teat 
M SO Mean% 



SmailSet 12 24 7 8.1 49.4 42.0 4.0 84.0 
Large ^3t 12 24.75 7.8 49.5 43.7 7.7 87.8 



40.5 7.1 81.0 
42.0 8.7 84.0 



Journal of Uamm^ Dtsabihf* 



43 



of a Mann-Whitncy U Test indicate 
that this difference was significant U 
m AZXp < .01. Nineteen subjects in- 
dicated they would like to learn more 
on a computer, and three subjects in- 
dicated that '•maybe" they would Two 
subjects, both in the Large Teaching 
Set program, indicated they would not 
like to learn more on a computer. 

Comparison with 
Nonhandicapped Subjects 

One purpose of this study was to see 
if, with special instruction, smdents 
with learning disabilities could per- 
form at the same level as smdents 
without learning difficulties. Thus, the 
50-item multiple choice test was ad- 
ministered to nonhandicapped 10th* 
grade smdents in a randomly selected 
English class in order to provide a 
comparison standard. As Table 6 dem- 
onstrates, the mean posttest scores of 
the mildly handicapped subjects were 
at a similar level to the nonhandi- 
capped smdents' mean scores. The LD 
smdents' scores were 84% and 87.4%, 
the nonhandicapped smdents' mean 
was 80.6%. Despite a significant dif- 
ference in reading ability, as mea- 
sured by the Metropolir..^ Achieve- 
ment Test (Prescottet aU 1978) and the 
Woodcock-Johnson (Woodcock ic 
Johnson, 1977), these LD smdents 
were able to learn word meanings so 
that, in this instance, they could per- 
form on a level similar to that of their 
nonhandicapped peers. The difTer* 
ence in performance between the LD 
groups and the nonhandicapped 
group was not significanL This result 
indicates thaL widi intelligent use of 
CAI, the LD smdents performed at a 
level similar to that of their nonhandi- 
capped peers. 

DISCUSSION 

The results of this comparison of 
two methods of computer-assisted vo- 
cabulary instruction with mildly 
handicapped high school smdents 
will be discussed in terms of (a) time 
! required to reach mastery criterion, (b) 
: growth in word knowledge, (c) transfer 
learning, and (d) smdeni attimdes 
gl^Q^vard computer-assisted instruction. 
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Time to MMStery. In previous smd- 
ies which attempted to improve sm- 
dents* word knowledge through the 
direa teaching of word meanings, the 
effects of various instructional pro* 
cedures were compared. Those smdies 
which demonstrated sizable gains did 
so at a large expense of instructional 
time. This smdy was the first to focus 
on efficiency as a dependent variable. 

The one unequivocal finding of the 
smdy was that subjects taught with the 
Small Teaching Set program reached 
mastery criterion on the set of 50 words 
significandy faster than subjects 
uught with the Large Teaching Set 
program* Also, more smdents in the 
Small Teaching Set program reached 
mastery within 11 sessions. 

Given that the groups achieved 
equivalent levels of performance on 
the multiple choice tests, their dif- 
ference in acquisition rates becomes 
even more meaningful Subjects 
taught with the Small Teaching Set 
program required less time to meet 
mastery criterittn on the words, yet 
iheir posttest performance and reten- 
tion was equal to that of subjects in the 
other treatment These findings have 
impr^rtant implications for teachers of 
low performing or reading disabled 
smdents. An efficient, compute^ 
as5isted method of vocabulary instruc- 
tion could provide an additional tool 
for teaching vocabulary, without plac- 
ing further burdeiis on teachers' time. 

Growth in Word Knowledge. The 
growth in word knowledge evidenced 
by both groups provides encouraging 
support for the use of computer- 
assisted instruction in vocnbulary with 
mildly handicapped smdents. Each 
group started with a pretest mean 



score of about 50%; after seven 20- 
minute sessiofu* each group's mean 
score was around 80% (Johnson, 1985). 
When subjects were tested after reach- 
ing the mastery criterion determined 
for their program, or after 11 sessions 
for those 6 subjects who did not reach 
mastery criterion, each group's mean 
score was around 85%. These scores 
reflected a commonly accepted min- 
imum mastery level: approximately 
85% correct Finally, on the main* 
tenance test, administered 2 weeks 
later, each group's mean score was 
above 80V Although the drop be- 
tween posttest and nuintetunoe test 
was sutistically significant, 80% is still 
a high level especially considering 
that subjects began at a 50% level 

Perfomumec on MtBSures Requiring 
Altemntife Response Formats. Sub- 
jects' low scores on the two transfer 
measures should not be surprising. 
Smdents were uught with a multiple 
choice format, requiring recognition. • 
The open-endea oral test of word 
meanings is a more difficult usk. The 
test of passage comprehension also re- 
quired response modes considerably 
more difficult than the multiple choice 
response mode of the CAI programs. 
While subjects scored 85% conea on 
the multiple choice posnest, they 
scored approximately 35% on the 
open-ended oral test of word mean- 
ings and 50% on the comprehencion 
test Lack- of specific training on the 
kinds of usks tested by the transfer 
measures was the primary reason for 
subjects' low scores. Without specific 
training on the response characteris- 
tics of transfer tasks, mildly handi- 
capped adolescents often fail to gen- 
eralize academic skills (Alley, Desh- 
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ler. Clark* Schumaken & Warner, 
1983). The implication is clear Sni* 
dents need training in transfer of skills 
learned in CAI formats. 

Studm Attitudes Towmtd Instrue* 
tion. On the attimde survey, most sub* 
jects indicated they enjoyed computer* 
assisted instruction and the CAI 
programs. When asked to indicate 
what they specifically liked about 
working on the computer, peiiiaps the 
most telling response was, ''It helps 
keep your mind on what you were (sic| 
doing.** Subjects* positive response to 
computer*assisted instruction lends 
credence to the claims of Budofi; 
Thormann, and Gras (1984) that ad* 
vantages of CAI with special educa* 
tion students include increased anen- 
tion« immediate feedback about 
performance, immediate reinforce* 
ment, and motivation. 

On the question ''Did you enjoy 
working on the computer?** subjects 
rated the SruII Teaching Set program 
significantly higher, as results of a 
Mann-Whitney Test demonstrated 
This finding is interesting as it relates 
to the design of CAI programs. While 
the Large Teaching Set program had 
an -arcade** type game, the Small 
Teaching Set was designed to foster 
rapid learning. During the study, some 
subjects in the Small Teaching Set pro- 
gram occasionally asked the experi- 
menter ^y they didn*t get to play a 
game like the one in the other pro- 
gram. The experimenter wondered if 
this difference in programs might bias 
the subjects against the Small Teach- 
ing Set program. The results indicated, 
however, that the subjects in the Small 
Teaching Set pro>, am, which tailored 
lessons to their individual learning 
needs, rated that program more 
highly. 

This fmding is important for those 
designers of CAI programs who ap- 
parently believe that for educational 
software to be motivating, it must ap- 
proximate computer games which are 
popular in video arcades and in the 
home video market. Results of the at- 
titude survey in this study do not sup- 
pon such reasoning. 

Suggestions for Future Research. 
^ This smdy contrasted two packaged 
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CAI programs. The major diflerences 
between the programs related to the 
size of the teaching and practice sets 
and the procedures for individualiza- 
tion and cumulative review. Ye^ other 
subtle diflerences between the soft- 
ware programs may have aflected the 
outcomes. Since an eflect for time to 
mastery was clearly demonstrated for 
the Small Teaching Set program, and 
posttest performance levels were 
equivalent for the two groups, fumre 
research might focus on only the 
Small Teaching Set program. By vary- 
ing the size of teaching and practice 
sets, and by comparing diflerent 
schedules for cumulative review exer- 
cises, more exact eflects of these vari- 
ables could be measured 

Although most subjects with learn- 
ing disabilities learned from the CAI 
programs, as expected, performance 
levels were low on the transfer mea- 
sua's. If disabled readers are to benefit 
from computer-assisted vocabulary 
instruction, future studies need to in- 
vestigate the effects of integrating 
computer-assisted and teacher direa- 
ed instruction. This integration should 
follow some of the principles articu- 
lated by Beck et al. (1982), although, 
hopefully, in a more efficient fashion. 
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NOTE 

I. The pretest was not included in the ASOV 
the following reason. The time between 
posttest and the maintenance test was the 
for all subjects (2 weeksi but the time hrt* 
pretest ond posttest was not the some fordiff* 
subjectt Thus repeated measure could nt* 
used 
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Figure Caption 

Figure 1. Example of a computer display of a valid argument. 
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Elaborated Corrective Feedback and t;he Acquisition of Reasoning Skills 
A Study of Computer-Assisted Instruction 

Much of the recent literature on improving special education teaching prac- 
tices has stressed the importance of providing specific academic feedback to 
students when they make errors (Carnine, 1980! Peith, Polsgrove & Semmel , 1981; 
Stevens S Roseshine, 1981). Correlational studies have shown that more effec- 
tive teachers provide immediate corrective feedback to students when errors are 
made (Englert, 1984; Gersten, Carnine & Williams, 1982). Furthermore, effec- 
tive teachers offer students infonr.ation on strategy or process that can be used 
to deduce the correct answer (Englert, 1984; Glaason, et al., 1985; Good & 
Grouws, 1977; Stallings, 1975). However, relatively little experimental 
research has evaluated the impact of the type of corrective feedback provided to 
learners. 

A meta-analysis of the limited research on corrective feedback by Ly^^kowski 
and Waiberg (1922) suggests that detailed corrective feedback is superior to 
merely telling students whether their answers are right or wrong. These authors 
iuggest that students need to see an overt model of all the steps necessary for 
an appropriate response. By observing a model of all the steps necessary in 
obtaining a correct response, students receive detailed information on how to 
solve the prcblerr.. This procedural knowledge should be of use when they 
encounter similar types of problems. Merely telling students they are wrong, 
does not. help them to solve the problem correctly. 

The corrective feedback provided in these various studies can be placed in 
one of three groups: a) minimal feedback (telling students if their answers are 
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correct or Incorrect, b) basic feedback (telling students if their response are 
correct, and, if incorrect, supplying the correct answer) and c) elaborated 
corrective feedback (telling students if their responses are correct, and, if 
correct, supplying the correct answer and providing additional information about 
the correct answer). Elaborated corrective ^°edback is similar to the correc- 
tion procedure Lysakowski and Wa'.uerg (1982) advocate, i.e., detailed feedback 
on how tc correctly solve the problem. Only two studies have addressed ela- 
bcrataa corrective feedback directly with a handicapped population. The results 
of t.*i2 resaarch have been mixed. 

A longitudinal study by Siegel and Crawford (1983) examined the effects 
of elaooratad feedback on mentally retarded students selected from both elemen- 
tary and secondary school settings. The researchers compared students taught 
with eldoorated corrective feedback with students trained on the discrimination 
tasks without any corrective feslback. The elaborated corrections group 
:rr::ns;.-s:2i: suoerior results over the no feedback group on a transfer test 
:iY-2r. sr.cr-.ly aftsr training as well as on the two-year follow-up tas-. 
Howevi-r, this study did not compare the effects of elaborated feedback with 
basic feedback in which s-.udents are told whether their response is correct, 
and, if correct, are told the correct response. 

Only one study, by Meyer (1982), addressed tnis issue witn handicapped learn- 
ers. Meyer examined the relative effectiveness of elaborated correction proce- 
dures and basic correction procedures in teaching word attack skills to two 
groups of learning disabled students in the secondary grades. The only dif- 
ference between the two groups was that, when students in the basic correction 
group made an oral reading error, the teacher told them that they were incorrect 
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and then sounded out the word correctly for them. In the elaborated correction 
group, when student made errors, the teacher reviewed the appropriate decoding 
rules, and then had the student sound out the word with her or him. Meyer 
reported no significant difference in oral reading performance between students 
In the two groups. Her findings are different than those of Carnine (1980), in 
which non-handicapped pre-schoolers who received elaborated correction proce- 
dures were able to decode more new wcris t.ian students who received basic 
corrections. 

There are two possible reasons ::r tnis anomaly. .This first is that the 
error rate was so low in Meyer's (1582) stuay-less than 4 percent— that ela- 
borated corrective feedback may not fiave been necessary. The few errors may 
have been due to temporary inattentiveness, slips of the tongue, rather than 
failure to know— and know how to apply the word atta rules. The second 
possible explanation relates to the difficulty of the skill to be taught. 
Grantj McAvcy, & Kesnon (15H2) irrje :r-.a: slaoorated corrective feedback is 
only necessary for cognitive skills tr.at are new and/or complex for the learner. 

The word attack skills for sounding out CVC and CVCe words were hardly new, 
or terribly complex to the high school students, many of whom had bpen exposed 
to similar rules for eight years. On the other hand, these same phonics appli- 
cations to word reading would seem complex to preschoolers (Carnine, 1980). 

The current study, like the Meyer study, compared the effects of elaborate 
corrective feedback and basic feedback in teaching remedial and learning disable 
students at the high school level. However, in this case, a complex cognitive 
task, inferential logic, was selected. 

Formal logic was selected as the content for studying feedback because of 
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its complexity. Also, Instruction In logic and reasoning skills has been 
generally lacking in the curricula of both "regular" and "special education" 
students In elementary and junior-high settings (Cnerkes, 1979). The difficulty 
in teaching reasoning skills may account for Its absence from the curriculum. A 
major purpose of the study was then to evaluate whether remedl^.l and learning 
disabled secondary level students coulc*. learn logical analysis skills from 
computer-assisted Instruction. 

However, the primary Intent of the study was to examine whether low- 
performing students who receive elaborated correction procedures would perfora 
significantly better than students provided basic corrections. We also 
examined any differences regarding acquisition time between students as well as 
attltvdes toward the instruction. 

Method 

Subjects 

Subjects were selected from six remedial (Chapter One) and special ecucaticn 
classrooms in two secondary schools in western Oregon. To qualify for the 
study, subjects had to demonstrate: a) a reading comprehension deficiency of no 
more than three years on district-administered, standardized reading comprehen- 
sion tests; b) at leas: fiftn grade oral reading level, as detei-niined by teacher 
judgment; and, c) an understanding of the concept of large and small classes, as 
evidenced by passing an experimenter-developed classification pretest. (See 
Measures section.) Reading comprehension scores were considered to Insure that 
subjects did not have a severe problem with reading comprehension. If -eading 
comprehension was a serious problem, instruction would' need to focus on reading 

TE2 



8 



comprehension, rather than reasoning skills. 

One-hundred and eighteen secondary students in remedial an<l special educa- 
tion were screened. Of the 2S whc completed the study, 13 were Isan^ing disabled 
and 15 were remedial. The Woodcock Reading Mastery Test (1978) wfs admini- 
stered to all 34 subjects. The subjects were matched on scores from the Word 
Comprehension r-ubtest of the Woodcock Reading Mastery Test and then randomly 
assigned to the Basic Correction or Elaborated Correction group (Cook and 
Campbell, 1979). Mean scores for the two groups on the screening test were vir- 
tually identical, 23.67 (SD » 1.15) for the Elaborated Corrective Feedback Group 
and 23.35 (SD » 1.28) for the Basic Corrective Feec^ack group. Mean scares on 
the Woodcock Word Comprehension Subtest were identical for both groups. Both 
were 35.79. (Standard Deviations were 6.6 and 8.7 respectively). 
Materials 

The Reasoning SkHls computer-assisted instruction program (Engelmann, 
Gamine & Collins, 1983) wa? designed to teach students two major objectives: 
a) to draw conclusions from two statements of evidence and b) to determine 
whether a thre'i-itaiement argument was logical or illogical. To enter the program 
students had to have class Inclusion skills, i.e., place members of a smaller 
class in a larger classes (e.g., car fitsin the class of vehicle, fox t^irrifr 
fits in the class of dog). The program then taught students about overlapping 
classes (e.g., tall things and buildings) and non-overlapping classes le.g. 
beys and girls). 

• Students learned three key words (all, sone, no) and their corresponding 
class relationships: 

for all the smaller c'ass is included In the larger class 
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for some the classes overlap 
for no the cla'ses do not overlap. 
Table 1 presents cases of £io and some arguments. 

Insert Table 1 about here 

The following three rules formed the basis for the first phase of instruc- 
tion — constructing arguments. The first rule is: 

"Start the conclusion with the key word . Before learning this rule, stu- 
dents learn what a key word is. They learn that in a syllogistic argument, 
one of the statements of evidence begins with the word all . The word at the 
beginning of the other statement is the key word. The key word becomes the 
first word in the conclusion (see figure 1). 

Insert Figure 1 about here 

In a conclusion, the key wora is followea by two classes. The second rule 
tells students how to identify the two classes named in a conclusion: 

"The conclusion names the classes that are named only once in the evidence . " 
In any syllogistic argument, one class is always named twi:e in the evidence. 
In Figure 1, men is named twice. The once-named classes ("humans" and "those 
witn brown hair") are uncerllned Figure 1. Tr.ese are the two classes tnat 
appear in the conclusion. 

The third rule applies to a subgroup of conclusions, only those with all as 
the key word. An aJJ[ statement always places a sma*»er class in a larger class; 
thus a conclusion beginning with aVI^ must name the smaller class first. (Class 
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order isn'w important for some and no conclusions beginning with some or no). In 
the following aJJ[ statements, a small class is placed in a larger class: 

All men are animals. 

All animals are living things 
Men is a snialler class tnan animals . Animals is a smaller class than living 
things . The small class must be named at the beginning of the ari_ statement. 
Thus, in tne ccnclusion {"All men are living things") small class, men, must 
be named first. 

Tne'for.sw'ng example is taken from the program. It includes the Esaocrated 
and tne Basic Corrections provided when an error was made. 

All athl etes are humans . 
All football players are athletes 
Question : Enter the number of the smallest class in the evidence. 

The correct answer is football player . If student responses were a) 
stnl 2:2s 3- -) humans, a correction procedure was supplied. 

Basic Correction : "The correct answer is football players." 
;-.a=or3ted Correction : The smallest class is named once and is named at 
the beginning of an ari_ rtafement. Football players is named once and 
is named at the beginning of an all statement, so football players is 
tne smallest class. 
This type of correction may appear contrived at first, yet is, we believe, 
essential to providing students with practice on the appropriate form of a logi- 
cal argument. • 

The otner major objective of the program was to teach students to identify 
unsound arguments. This is discussed in detail in Collins (1984). For logi- 
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c?ny unsound arguments, students were taught to specify one of three reasons 
why an argument Is unsound* 

In the study, the E laborated Correction treatment used an unaltered copy 
of the Reasoning Skills program. The Basic Correction treatinent used a modified 
version of the program. In both conditions, students worked individually on an 
Apple He microcomputer. In the Basic Corrections program, if students made 
errors, they were given the correct answer, as s?ecif:ec above. This was the 
only difference between the two conditions. 

One feature that was the same for both trest-i^ents was a provision for review 
of missed Items. When students missed a question, tne program presented the 
question again later In the lesson. Thus, all missec questions were answered 
correctly before the end of a lesson. 

Measures 

Screening Measure and Pretest s 

The group-administered screening t<iSt was ces'ignec to cerar.'Tiine whether sub* 
jects had any background In specifying wnicn ooj-^c's fit into a particular 
class. For example, which of the following are venicles (circle two) a) cars; 
b) toy dolls; c) airplanes; d) windmills. The test also assessed whether stu- 
dents had mastered the concepts of larger and smaller classes (f;.g., which class 
is larger, the cUss of trucks or the class of venicles?), Stu^erts who scored 
less than 90 percent on this measure were excluded from the study. Knowledge of 
classification Is a prerequisite skill for the reasoning skills program, because 
large and small class information Is used in the deductive process. 

The Word Comprehension subtest of the Woodcock Reading Mastery Test was 
given individually to each st'jdent prior to the study." The subtest contains 70 
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items in analogy form. Sol it^hal f rel iabU ities range from .83 to .96, 
depending on grade level. The median concurrent validity correlation (with 
'ither subtests In the instrument) is .86. This subtest contains 70 items in 
analogy form. Since this subtest required an understanding of classification, a 
skill requirement for the CAI program, subjects were matched according to the 
scores on this subtest prior to random assignment to one of the two treatment 

groups. : 
Students were also given Form / jf the Test of Formal Logic (see below) as a ^ 

pretest. 

Dependent Measures 
These included: 

a) a test of formal logic measuring acquisition of the CAI progam's con- 
tent, 

b) a transfer test, designed co assess generalization of subjects* skills 
to new .'naterial , and 

c) an attitude survey, measuring students' attitudes toward the computer- 
assisted instruction and the reasoning program. 

Test of Formal Logic (Collins, 1984). This was the primary dependent 
measure in the study. The purpose of this test was to measure a student's abil- 
ity to construct arc analyze syllogistic arguments. Two alternate forms of the 
test were designed. Form A was used as a pretest measure and then read- 
ministered two weeks after treatment as a maintenance test. Form B was given to 
all subjects immediately following the treatment as a post-test measure. 

Fifteen items tested the skills for drawing conclusions from stated evi- 
dence. The following item is an example: 
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Here's the evidence: 

All trains are vehicles. 
Some trains are steel objects. 
1. What win be the first word in the conclusion (all, fme or no)? 



2. Write the conclusion: 

In the remaining eight items, students analyzed a syllogism ana determined if it 
was "sound" or "not sound," and if "not sound", explained why. 

The content val iditv of the two forms of the Test of Formal Logic was 
assessed by four university instructors and 15 teachers in a graduate level 
college class. These persons were choser because they were considered poten- 
tial "users" of the program. These teachers examined the items and indicated 
those items which they felt were inappropriate and their reasons for exclusion. 
Based on tneir comments, these items were either dropped or revised. 

The internal c onsistency reliability (coefficient alpha) for the instrument 
was .90 for Form A and .91 for Form B. This was based on a sample of 23 stu- 
dents not included in the experimental study. Parallel form reliability between 
Forms A and B was .84. 
Transfer Test 

This IS-itsm test evaluated subjects' abilities to generalize what they had 
learned on the computer to similar analytical tasks, but in prose paragraph 
form. The transfer test was 

devoted to the more difficult objective of the program — deciding whether argu- 
ments were sound, and, if not sound, giving a reason. Two examples of these 



TE2 



14 



arguments appea^ bel^w: 

A. Michael and Sam were sitting in the Science Lab at school. 

Michael had a bottle of arsenic in his hand. Sam said, "Hey, you know that 
all poisons are dangerous!" Michael looked at the bottle of arsenic. 
Michael said, "It says on this bottle that arsenic is a poison. So, arsenic 
must be dangerous!" "Yes, I guess so", replied Sam. 

B. It is vrry easy to describe high school students in America. All 
high school students are people who want to drive cars. They always talk 
about borrowing their parents' car and driving around town. We can also say 
that all high school students are people who have to take tests. I wonder 
why all people who have to take tests also want to drive cars. 

The transfer test was given to subjects on the day following the completion of 
training on the CAI program. Both invalid and valid arguments were included, 
with a heavier emphasis on invalid a^^guments (5 valid and 10 invalid arguments). 
The argun ^nt forms parallelea tncse arcur.ents taught in the reasoning skills 
program. 

Results 

Test of Formal Logic 

Table 2 presents the descriptive statistics for the pretest, the post and 
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maintenance administrations of the Te'* of Formal Logic. The Elaborated 
Corrective Feedback group had a slightly lower mean pretest score than the Basic 
Corrective Feedback group on the pretest; this ciifference was not significant. 
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The posttest mean for the Elaborated Correction group was 13.0 and for the Basic 
Correction group 15.2. On the maintenance test, respective means were 18.7 and 

13.2. 

A 2x3 analysis of variance with one between subjects factor (Type of 
Correction) and one within subjects factor (Time of Test) was performed on the 
data* A planned comparison was utilized, contrasting performance of the two 
grcuos on both the post-treatment measures (immediate posttest and maintenance 
tes-). The ANOVA indicated a significant difference favoring the Elaborated 
Corracticns group; -gj = 9.37; d <.001. This result indicates a significant 
effect for tne use of elaborated corrective feedback on both immediate posttest 
ana niaintananca test. 
Transfer Test 

The mean score was 12.07 for the Elaborated Group and 10.29 for the Basic 
Group. Standard deviations were 2.97 and 2.56 respectively. These results indi- 

3 c-:-ificant difference between the two training groucs; t_/-c^ = 1'70, 
2<.:5, again favoring the Elaboratpd Corrective Feedback group. 
Time ?_er Lesson 

Data were collected on the time students took to complete each of the five 
lessons. The purpose of this analysis was to see whether students in the 
Elaboratad Corrections group took more tiir.e to complete the lessons. The 
average daily lesson time was 24.86 minutes for the Elaborated Corrective feed- 
back group and 23.93 for the basic group. A 2x5 analysis variance (ANOVA) with 
repeated measures was performed on the time-per-1 esson data. The between- 
subjects factor was type of corrective feedback training method. The within- 
subject factor was time per lesson for five lessons. The analysis did not show 
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any significant difference between the time the Elaborated group and the Basic 
group took to complete lessons. 
Student Attitudes Toward Instruction 

A survey was administered after the lessons to all students to determine 
whether any difference existed between the two groups on attitudes toward 
instruction. The items and mean scores on a U)cert Scale are presented In Table 
3. 

Table 3 Here 



There was only one significant difference between the two groups, on item 4, 
t_(26) = 5.14, £ <;.001. This item asked students how well they felt they could 
detect unsound arguments. Students in the Elasorataa group felt they were 
better in picking out a sound argument. Thus, the self-report parallels 
the performance data. 

Oi scjssion 

This sfjdy was the firs: to explor-j exseri.TK-r.iiily tne er'fectivenesr of ela- 
'borated corrective feedback in teaching a complex cognitive skill to handicapped 
learners. These results, unlike those of Meyer (1982), indicate this Is an 
effective instructional procedure. In the Meyer study, students were familiar 
•with zr\e pncr.ic rules. The difference in tne content of the two studies 
suggests elaborated correction procedures may be necessary only when students 
are learning new or complex rules or strategies. The merit of elaborated 
correction procedures is confirmed by the significantly higher scores on the 
transfer test and the greater confidence in identifying sound arguments for the 
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students in the Elaborated Corrections group. 

At first glance, though, the roughly equivalent times for the two treatments 
to complete the lessons seem anomalous. With more text to read in elaborated 
corrections, that treatment would seemingly take longer to complete the lessons. 
CompUcion times were not significantly greater for the elaborated corrections 
group, however. 

The extra time required to read the elaborated corrections may have been 
compensated for by faster acquisition of the material. As mentioned earlier, 
the computer would return a student to items that were missed earlier In a 
lesson. If elaborated corrections resulted in fewer mistakes, stuoents would 
spend less time returning to missed items. This Interpretation suggests that 
taking more time early in a complex instructional sequence to offer elaborated 
corrections may in fact lead to savings in instructional time later in a 
program. 

Both the basic and elaborated correction groups improvea their reasoning 
skills as measured by the dependent variable. The groups demonstrated a mean 
score of 68-70% on the posttest (a dramatic gain from the mean scores of 26 to 
34 percent on the pretest). The systen<atic design of the instruction — par- 
ticularly through a series carefully controlled rules--may have contributed 
to this gain. 

Well-designed computer programs could increase students* academic engaged 
time in two ways — by replacing some time spent in seatwork with Interac- 
tive instruction, which tends to have higher engagement rates (Rosenshine, 
1983) and by having students receive interactive instruction outside of 
school. These applications are particularly significant because the program was 
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a true tutorial. Reasonng s'kIIIs were acquired without any instruction from 
the teacher. Typically, CAI programs merely provide drill and practice exer- 
cises to supplement teacher instruction. Here the program did all the 
Instructing. This finding goes against prevalent research (Winkler, et al., 
1984) suggesting that properly designed CAI best serves as a supplement to con- 
ventional instruction. 

In this study, CAI succeeded at providing initial instruction. With 
suitable content (e.g., rule based) and viable instructional design (Engelmann & 
Carnina, 1982), CAI can play a much larger role in instruction. 
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Table 1. Examples of Syllogistic Arguments 

Example of an argument with "All" Statements of evidence : 

All men are human. 
Al 1 humans require oxygen. 
(So) AlJ_men require oxygen. 

Example of an arqu>;.ent with a "No" statement of evidence ; 

All men are human 
No birds are human. 
(So) No birds require oxygen. 
Example of an argument with a "Some" statement of evidence: 

All men are human. 
Some men are over six feet tall. 
(So) Some humans are over six feet tall. 
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Table 2 

Means. Standard Deviations and Mean Percent _of Items Correct on Criterion 
Re.ferenced Measure (Test of Formal Logic ) 







Time of Test 






Pre 


Post 


Maintenance 


Elaborated Corrections 


{N^ * 14) 






M 


6.0 


18.0 


18.7 


SO 


3.67 


4.69 


3.97 


Me£n Percent Correct 


25.1 


78.3 


81.3 


Basic Corrections (N^ = 


14) 






M 


7.0 


15.2 


13.2 


SO 


3.55 


6.14 


6.81 


Mean Percent Correct 


30.4 


55.1 


*7 i 
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Table 3 

Student Attitudes Toward the Program 



Group 



Questions . 

Elaborated Basic 
Correct 1 ons Correcti ons 







M 


SD 


M 


SO 


1. 


Was this program intareszinc tr. ycu? 
(l-yes 0-no) 


.79 


.43 


.86 


.36 


2. 


How well did you enjoy the program? 
(2-1 enjoyed it l-I feel OK about 
th<? program O-I did not enjoy the 
program) 


1.57 


.51 


1.36 


.53 . 


3. 


Do you feel you understood the rules 
and examples In the program? 
(2-an the time 1-sometinies 0-never) 


1.29 


.47 


1.20 


.58 


4. 


Do you feel like ycu can new ?icK out 
a sound argument? 

(2-all the time l-sometimss G-^ever) 


1.43 


.51 


1.07 


.27 
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Conclusion 



la. All men are 
human. 



Human 



Men 



lb. Some men have 
brown hair. 



M.cn 



■Hair 



1c. Some humans have 
brown hair, 
or. 

Some brown haired 
things are human. 



Human 



Brown I 
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Secc:idary students spend a considerable amount of their Ume completing 
application-oriented activities. In performing these tasks, students are asked to make 
a variety of inferences about a subject area by pn'dently using facts, concepts, and 
strategies or problem solving skills. Unfortunately, ii is easier to just teach students 
rote memory information and procedural knowledge (i.e the lUfiial algorithms used 
in solving a problem) than the comprehension and problem solving skills called foi in 
many application items (Doyle,l983). 

Some writers (Cherryholmes, 1966; Greenblat & Duke, 1975; Cruickshank & 
Tefler, 1980; Budoff, Thormann, & Gras, 1984 ) have suggested that one way to 
enhance these kinds of cognitive skills is through educational simulations. 
Simulations are thought to increase student participation (Boocock & Schild, 1968; 
Farran, 1968; Stembler, 1975) and allow low achieving students the much net ded 
practice in applying what they've learned to new situations (Cohen & Bradley, 1977). 

Yet the results of research on educational simulations, on the whole, have been 
discouraging. After an extensive review of studies conducted in the early sixties, 
Cherryholmes (1966) found that the effects of simulations were no greater than 
conventional instruction. Students neither learned more facts and concepts than 
»hey did in conventional instmction, nor did they show the anticip^ d increases in 
critical thinking and protlem solving. A more recent review of simulation research by 
Pierfy (1977) reached conclusions similar to those of Cherryholmes. At best, the 
results have been mixed regarding the effects cf educational simulations (McHenr/. 
1969; Livingston, 1971; Greenblat, 1973; DeNike, 1976; Pierfy, 1977; Jackson. 1979; 
Breiemeier & Greenblat, 1981; McKenzie & Padilla, 1984). Current research in 
computer simulations and other computer enrichment activities, although limited, 
also indicates little support for these techniques (Bangert-Drowr.j, Kuiik, & Kulik, 
1985; Waugh, 1986, 



Much of the research on simulations has been plagued by fundamental 
weaknesses in research design. Several interventions have been a.uch too brief, 
usually with otily one play of the simulation (Boocock & Schild, 1968; Fletcher, 1971; 
Pierfy, 1977). Quite a few studies {Livingston, 1970; Emery and Enger, 1972; 
Fennessey et al., 1973; Brenestunl, 1975) used rather crude quasi-experimentai 
designs involving intact classes being assigned to treatments on a non-random 
basis. In some cases, the intent of the simulation games, and hence the research 
hypotheses, were pooriy formulated (Williams, 1980). 

F-nally, there are problems with criterion measures used in many simulation 
studies. Of the 22 studies reviewed by Pierfy (1977), virtually all of them were 
investigator-developea criterion measures, with very little detail about the 
construction of the measures or their reliability and validity. By failing to thoroughly 
analyze the instructional goals of simulations, some researchers did not design 
measures to capture everything taught by the games (Megany, 1977). 

The problems with simulation research, however, go well beyond research 
design and instrumentation problems. Fletcher (1971) noted that many of the 
simulations used in the research had never been field tested, and thus were of 
unknown quality. He cited the great variation in the quality of the games used { in 
terms of complexity, levels of sophistication, and interaction among participants) as a 
major source of the weak results reported. 

The study reported here aiter'pted to remedy many of the problems and issues 
cited above. The obvious problems regarding the length of intervention and random 
assignment were relatively easy to avoid. Also, validity and reliability issues cf the 
measures used are addressed. We attempted to created an instrument that reflected 
the problem solving skills actually taught in the simulation.. The intent and 
instructional goals of the simulation HfialUlMya were formalized into a set of 
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specific stwti^gies, all monitored by a meta-stratagy. Figure 1 represents the 
problem solving strategies used in the simulation. 

Insert Rgure 1 about here 



The simulation itself was fie'd tested in several junior high school classes and 
with students of varied abilities. Problematic features of the simulation were refined 
after each field test. Other components of the intervention, which will be discussed in 
the methods section, were pilot tested before the study. 

The present study also attempted to address a curious feature of most past 
simulation studies. Virtually all studies have directly contrasted simulations to 
conventional teaching methods. Only on a f 3W occasions have the combination of 
conventional instruction and a simulation been compared to a conventional method 
alone. This is understandable with non-computer simulations, as time 
considerations (i.e., the number of days or weeks needed to fully implement a 
simulation) usually prohibited a combined intervention. 

With computer simulations, the situation is different. Tne effects of different 
variables can be demonstrated quickly (Doob, 1972), and less instructional time is 
required to demonstrates causal relationships and essential concepts. 
Unfortunately, very few experimental studies involving educational computer 
simulations have been documented. None had compared conventional instruction 
and a computer simulation with just conventional instruction. 

A final aspect sets the present study apart from eariier research. Conventional 
instruction »"as delivered according to principles derived from recent teacher 
effectiveness research (cf. Brophy & Good, 1986). Teacher modeling, high rates of 
teacher-student interaction, guided practice, and structured seatwork were used in 



ERJC 



4 

teaching oasic facts and concepts to both groups in this study. This enabled us to 
better gauge the additional effects the simulation had on basic facts and concepts. 

A secondary purpose of the study wcs to determine the extent to which a 
simulation could assist secondary learning disabled students in special education 
classrooms in acquiring factual information and problem solving skJIs. In adaition to 
evaluating these students' performance in absolute terms, their performance was 
compared to that of their nonhandicapped age mates who were enrolled in health 
education courses. 

Method 

Subjects 

All of the subjects were learning uisabled high school students eligible for 
special education services by federal and local standards. Because of the reading 
requirements of the simulation, students who scored lower than ihe sixth grade 
reading level, as measured by the Metropolitan Achievement Test (1970). were 
excluded as subjects. The thirty students who met these criteria participated in the 
study; an equal number of students in three classes were randomly assigned to 
either the conventional or simulation condition. 
Materials 

Health Wavs is a commercial software program developed by Carnine, Lang, 
and Wong (1985) for the Apple II and IBM PC computers. As a siinulation. Health 
Ways requires learners to manipulate several variables (e.g.. life threatening 
diseases, stress levels) in order to achieve an optimal life expectancy. Learners are 
presented a basic health profile which can be examined further by selecting various 
options. An example of such a profile is presented in Figure 2. For example, 
learners can inquire about eating habits by selecting ...d nutrition option. By doing 
this, they can see how much cholesterol, sugar, salts, etc.. the profile character 
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consumes. Learners must make changes in the most life threatening health habits 
(e.g., heavy smoking) and control other variables to be successful at a Health Wavs 
game. 
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We also developed Ihe Health Ways Supplementarv Curriculum - an 
accompanying written curriculum (Woodward & Gurney, 1985) that extended 
information presented in Health Ways and the original Health Wavs teachers guide. 
Information was taken directly from ^A/c widely used junior high school health 
textbooks. All of the information was rewritten to control for vocabulary and amount 
of new information. Clippings from newspapers, news magazines, journal articles, 
and health pamphlets were also used in the supplementary curriculum. Even though 
the supplementary Tiaterials relied heavily upon texts and other sources, 
considerable preparation time was required. Researchers estimate that it took 
approximately 70 hours to create the modified curriculum. The reading level of the 
curriculum, as determined by the Fry (1977) Readability Test, was approximately 
sixth grade. 
Pror° dures 

All students were instructed for 40 minutes per day for twelve days. The lesson 
consisted of two parts. The first part, called structured teaching , was identical for 
subjects in both conditions. A stmctured teaching method, following the model 
proposed by Rosenshine and Stevens (1984) and Brophy and Good (1986), was 
used for this segment. 

Stmctured Teaching. Instruction was conducted in a large group of 12 to 15 
students for this part of each lesson. Instruction began by reviewing essential 
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information from previous lessons. Students were then presented with a list of 
vocabulary words, which were essential to the day's lesson (e.g., cholesterol, 
diabetes). For the next fifteen minutes, each student independently read the two to 
three pages of text for that day's lesson and answered written comprehension 
questions. The teacher the.i discussed the answers with the group and presented a 
series of review questions covering the main points of the lessm. At the end of the 
initial instruction, students separated into twc groups: one which worked with the 
computer simulation (the Simulation Group) and one which workeo o" traditional 
application activities (the Conventional Group). 

The Conventional Group worthed in the rPoOurce room under the supervision of 
the resource room teacher, who presented these students with a variety of 
application or review activities. These exercises, typical of a high '^hool health 
education class, were reviewed for representativeness by two health teachers at the 
high school where the siudy took place. For example, the Conventional Group 
students kept track of their diets for three days and analyzed the;; cholesterol levels. 
Other exercises included analyzing one paragraph profiles of different individuals 
and diagnosing poor health habits. Students completed the exercises during the last 
twenty minutes of the period. 

Simulation Group students were taught in a computer lab, each student working 
individually at a microcomputer. Students in this condition wori<ed on the Health 
Ways simulation for twenty minutes each day. Instruction over the twelve day period 
was broken into three phases: initial modeling of the simulation tutorial and one 
simulation game (three days), guided practice on three simulation games (two days), 
and independent practice with individual feedback from the instructor (seven days). 

In the i nitial modeling p hase, the instructor modeled each component of an 
effective strategy for playing the Health Wavs games. The teacher modeled wori<ing 
on the most important health problems first, to control stress immediately if it rose to a 
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level that was too high, and to be sure to maintain changes by using the 
maintenance option. Each component of th =5trategy was modeled in isolation and 
then integrated in later instnjction. The most important instmctlon in the initial 
modeling phase invo'ved the orloritizing of health problems. Students were first 
taught to look at current disease and hereditary information. If the profile indicated a 
heredity of hypertension, this meant that the student should first look at the 
individual's diet and check the level of salt consumption. If it was high, this led to a 
con-elated change (i.e., an heredity of hypertension implies the need for a low salt 
diet, thus the student must change the character's salt intake). This habit was the first 
to be changed. The remaining habits were identifie'd in thoir order of priority (i.e. the 
second and third most important habits). 

In the guided practice phasg of instnjction, students were briefly taught in a 
group with one microcomputer. They were shown the initial screen in a Health Ways 
game profile, and individual students were asked to prioritize the first three health 
problems or habits and their correlated changes. Once correctly identified, the 
students were shown .nother profile. After three or four profiles, each student went 
to his or her computer in the lab and played the Health Ways games. 

The independent practice ohase allowed the students to play Health Way«^ 
games continuously for twenty minutes. The teacher circulated among the students, 
observing and commenting on a student's "play" of a game (i.e., how well he or she 
employed the strategy taught in the modeling phase of instnjction). 

All teaching was done by the researcher and a certified special education 
teacher. Assignment of teachers to treatment was counterbalanced, with the 
researcher and the teacher changing groups half way through the experiment. This 
was done to control for the effects of the teacher, a common problem that has flawed 
many computer based instnjction studies (Bangert-Drowns, et al., 1985). The 
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amount of total instructional time was controllec; in this study; both groups received 

the same amount of teaching and independent work. 

Measures 

Students were assessed one day, two days, and two weeks following 
instruction. On the first day, acquisition of basic facts and concepts taught in the 
cumculum was measured by the Nutrition and Disease Test. The Nutrition and 
-Disease Test was a 30 item test designed to measure students' retention of the 
important information contained in the written curriculum. Questions on the test were 
fill-in the blank, usually requiring only one or two word answers. The first 20 
questions were solely from the written curriculum. The remaining n were questions 
over material that appears in both the written curriculum and the Health Wavs 
simulation. Internal consistency reliability (coefficient alpha) of this measure is .84 
■)ased on a sample of 42 students. The Nutrition and Disease Test was given again 
two weeks after the instmction as a retention measure. 

The second measure, the Health Diagnosis Test, was a set of three written 
profiles administered two days after the instmction. This test measured the student's 
ability to detect important health problems facing an individual, identify and change 
related health habits, and control stress as it increased due to the health changes. 
Central to the Health Diagnosis Test was prioritizing health problems. For example, 
the test measured the student's ability to not orjiy identify, but order the health 
problems in terms of their Importancv, .o the individual's longevity. The Health 
Diagnosis Test has a test-retest reliability of .81. Table 1 presents an example of one 
cf the test's three profiles. 



Insert Table 1 about here 
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SCQnnq PrQcediJ£g&. For the Nutrition and Disease Test, only answers 
contained in the written curriculum or acceptable synonyms were considered correct. 
Subscale scores were obtained for two sections: a) those items reinforced and b) not 
reinforced by the Health Ways simulation. 

Special procedures for scoring the Health Diagnosis Test were developed. 
Three different areas were assessed in the measure: a) identifying important health 
problens, regardless of order, and making the appropriate correlated change b) the 
ability to prioritize health problems and c) ?kttending to stress when it was at a high 
level. 

Current health f?cts and statistics were used to develop criteria for correct 
prioritizing. The criteria state that the learner should attend to current disease and 
hereditary information in determining which health habits are most detrimental, 
hence, which habits need to be changed first. A strict and a moderate criteria were 
used to measure students' ability to prioritize. To score at the strict criteria, a student 
nnust change the three most important hv3alth problems in a specific order (i.e. the 
habit associated with the current disease first, the one associated with the hereditary 
disease second, and tho remaining detrimental habit third). These criteria were 
establibhbd by a committee consisting of the experimenter, a professional health 
educator, and two special education researchers. Students who scored at the 
nnoderate cnteria simply changed, in any order, the habits associated witii the current 
disease and horoHitafy problems within tne first three changes. 

In addition, both tests were also given to a random sample of non- 
handicapped high school students from health classes. Tests were given to tenth, 
eleventh and twehlh graders in introouctory and advanced health classes. Their 
scores were compared with those of the two grr jps that participated in the study. 
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fissuits 

The Nutrition and Disease Test 

A 2 X 2 (treatment by time of test) analysis of variance /ith repeated measures 
on one factor was performed on the total number of correct respons-s on the 
Nutrition and Disease Test. Table 2 provides the descriptive statistics for the correct 
number of responses for the post and maintenance tests for each group. Means 
were also converted to a percent correct. 



Insert Table 2 about here 



The analysis shows a significant main effect for instructional method; F^^ 28) = 

5.30, c < .03. For both treatment groups there was a significant drop in scores from 
post to maintenance test; E(1,26) = ■'6-23.Q< .001. No significant interaction was 

found. The simulation had a significant effect on mastery of key concepts in the unit; 
this effect was maintained over a two week period. 

Subscales analyses. The 30 item test was broken into two subscales: a) items 
reinforced by the Health Wavs simulation and b) items taught in the curriculum and 
not reinforred by the simulation. Separate 2x2 ANOVAs with repeated measures 
were performed on each subscale. The effect on items reinforced by Health Ways 
was significant; £ (i 28) ■» 40.02, ft < .01 . The effect for items not reinforced, however 
wai, nonsignificant; £(i 28) = 3.73, ft < .06. This demonstrates that the simulation 

was an effective vehicle for reviewing material that had already been presented in 
the written curriculum. 
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Health Diaonosis Test 

Scores for the conventional and simulation groups were compared on a) tho 
total test score and b) the total test score without stress as a factor. The reason for 
this was that the simulation group was explicitly taught ♦he relationship between a 
health change and an increase in stress through Health Wavs. Students in the 
conventional grr-jp were never taught about this relationship, thus it would be 
unlikely that these students would immediately control the stress level in the 
Diagnosis Test. Therefore, the factor of stress was removed from the analysis of the 
total test scores, which appears at the top of T?' 3. 

The three essential problem solving skills for the Health Diagnosis Test were 
independently compared: (1) prioritizing health habits (2) stress management and- 
(3) identifying health problems and making correlated changes. The t-tests 
demonstrate a significant difference between the two groups accordTPtg to all 
analyses demonstrating that the intervention had a consistent, significant effect on 
problem solving skills of the simulation students. 



Insert Table 3 about here 



The correlation between Metropolitan Achievement Test (MAT) Reading scores 
and total scores on the Health Diagnosis Test was non-significant (.12), much 
weaker than the correlation of .44 between MAT Reading and the Nutrition and 
Disease Test, a more conventional academic measure of facts and concepts. The 
non-significant correlation suggests no relationship between traditional academic 
measures (such as a standardized achievement test) and the problem solving skills 
measured by the Diagnosis Test. 



In addition trj the generally superior performance by the simulation students on 
the Diagnosis Test, there is some indication of a moderately strong relationship 
between their scores on this test and their understanding of the simulation. At the 
end of the third profile, the examiner asked each student to state his or her reasons 
for making the first, second, and third changes on the profile. In other words, the 
examiner asked why were the changes made in the order specified by the student. 
Responses to this question were categorized as 1) the student guessed (didn't know, 
didn't care, didn't know why) 2) the student was working on health problems but in 
no apparent order (i.e. no prioritizing strategy was used) or 3) the student worthed on 
the most important health problem first (i.e. some kind of prioritizing strategy was 
used). The correlation between a student's score for this response and his or her 
total test score on the Diagnosis Test was .69. This suggests a moderately strong 
relationship between the strategies that students thought they were using in the test 
and those that they actually used. 

Secondary Analyses: Comparison wi th Non-Handicaoped Hioh School .<^tijdent«; 

In a supplemental analysis, a one way analysis of variance (ANOVA) was used 
to compare the test performance of the conventional and simulation groups with non- 
handicapped students from regular health classes who did not participate in the 
study. The purpose of this quasi-experimental comparison was to extend the post 
test analysis to students of a comparable age group who were also receiving health 
instruction. Again, scores from each section of the Health Ways Nutrition and 
Disease Test and the Health Ways Diagnosis Test were analyzed. These results are 
presented in Table 4. 



Insert Table 4 about here 
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Total score on the Diagnosis Test showed significant differences between the 
groups (F(2,42) = 27.36 < .001). A Tukey post-hoc comparison indicated 

significant differences favoring the handicapped simulation students ir. coiTipaiison 
to regular classroo'" students (a < .01). There was an equally significant difference 
favoring the regular classroom students over the learning disabled students in the 
conventional group (ft < .01). The learning disabled students in the simulation group 
had problem solving skills on the health profiler superior to those of non- 
handicapped students in regular health classes. The non-handicapped students, in 
lUrn, out performed the handicapped students m the conventional group. 

A significant difference also appeared between the groups on ihe reinforced 
items on ihe Nutrition and Disease Test. f{2A2) = 5.35. ft < .01 . Tukey post-hoc 

comparison showed a significant difference between tho special education 
simulation group ar.d the two other groups (u < .05), favoring the handicapped 
students taught by Health Wavs. Differences on the non-reinforced subscale items 
were non-significant. 

Discussion 

The Health Ways simulation was an effective tool in tcachi : material not 
easily taught by traditional means. In this study, we used direct instruction 
techniques in the initial phase of instruction to teach material rewritten from widely 
used health textbooks. The results indicate the use cf computer simulations can 
effectively complement traditional instruction. 

In addition to providing the experimental students with a simulation, we taught 
an explicit strategy that enabled them to be successful (see Rgure 1). The results 
support tha view that a structured approach In simulations, one where learners' 
tactics are specified and guided, does have significant educational affects. 
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The analyses indicated that the students in the simulation group performc 1 at a 
significantly higher level o- "wealth fact and concept items that the simulation 
reviewed. The «5imulation also had a significant effect in developing problem solving 
skills in health. Simulation students were significantly better at prioritizing specific 
health habits, ones that reeded to changed in the game's profile character in order 
to improve his health anJ longevity. 

The superior performance -by the learning disabled <!tudents in the simulation 
group over non-handicapped students from regular health cla ses suggests the 
extent to which explicit strategy instruction can be successful in teaching problem 
solving shils. The two non-handlcapped students who had the highest scores on the 
Diagnosis Test articulated a prioritizing strategy comparable to \f it given by several 
of the special education students. Thus, many of the spedal education students in 
the simulation group showed a conscious awareness of the strategies that they were 
using, as did the two untaught, non-handicapped students, who may have achieved 
their awareness in a more Intuitive manner. 

Both the instruction in basic health concepts and the explicit strategy for the 
simuli^tion were based on instmctional design principles described in the Theory of 
in?tfyct'Qn (Engelmann & Gamine. 1982). Instruction began with models by the 
teacher of both successful and unsuccessful str?iegies that could be used with 
Hga'th Ways. Next, students practiced the strategy over a range of profiles with 
feedbaci< from the researchers. Gradually, the explict reminders or prompts about 
steps in the strategy were removed, and the simulation students wori<ed on Health 
Ways profiles independently. 

The simulation .tself was also designed to foster the acquisition of a strategy. 
The Health Ways simulation was preceded by a tutorial containing three simpler 
versions of the simulation profiles, each one slightly more complex than the 
preceding one. This gradual progression from simple to cc nplex allowed aspects of 
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the overall strategy to be introduced and practiced one at a time. The contribution of 
detailed, explicit strategy instruction {i.e.. the methods described by Engelmann & 
Carnine. 1982; Palincsar & Brown. 1982) has been investigated with handicapped 
students in reading comprehension (Gamine & Kinder. 1985). content area 
instruction in science (Darci-. & Carnine. in press) and logical reasoning (Collins. 
1 984). Some research on computer simulations suggests that when contrary 
procedures are followod (e.g.. informal instruction, particularly where clear strategies 
and corrective feedback are absent), student learning is insijnificant (Waugh. 1986). 

The present results indicate that simulations combined with instruction in 
strategies for successful use of the simulation can contribute to a student's learning 
of both factual information and problem solving skills. " However, the results say 
nothing a jout the use of computer simulations 1 3 "stand alone" activities. Since the 
two treatn ents different in several respects, we cannot isolate a specific variable that 
accounted for the resulto. That 15 learning disabled students exhibited problem 
solving skills and that this was true for only two non-h-ndicapped students 
underscores the potential jf combining instruction and simulations. Future research 
could address the simulation alone by comparing Health Ways to Health Wavs 
accompanied by explicit strategy instruction. These findings would help articulate 
the context in which computer simulation.^ can be of the most benefit to students. A 
separate line of research will continue to evaluate the various components of 
strategy instruction, whether mediated by a teacher or by a computer. 
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Health Ways Simulation Profile 



Today's 
Year Week Day Expected 

50 00 0 65 
Win Power ^ 180 
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Lung cancer 

No current ailments 
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Stress * 35 
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Weight 


20 pounds overweight 


B) 
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Moderate 5 times/week 


Nutrition 


See Sub-menu 
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See Sub-menu 


^ \ 
^/ 


Maintenance Menu 




H) 


Stress Reduction 


Menu 




Help 





S2 



Table 1 



Health Facts on Julie Hill 

Age: 34 

Herftdity! diabetes Current Diseases: liver disease 

1. Weight and diet: 7 pounds overweight 

a. Eats breakfasts 

b. Eats a lot of food with cholesterol 

c. Eats a lot of empty calorie sweets 

d. Eats very little food with sodium 

e. Drinks very few beverages with caffeine 

f. Eats a lot of food with fiber 

g. Eats balanced meals 

2. Tobacco: non smoker 

3. Alcohol: light drinker (3 drinks a week) 

4. Exercise: exercises 5 times each week 

5. Stress: average stress in her life 



Table 2 

Means (M). Medians (Mdn). and Standard Deviations (SD) of Number 
of Total Correct Answers on the Nutrition and Disease Test 
by Ir^tructional Group 

EOSLISSt Maintenance Test 

Instructional Group 

M3an% Mean% 

N M Mda Qonasi H lA mi Sn Qsmssi 

Simulation 15 22.00 21.5 3.72 73.3 15 19.97 20.5 5.08 66.5. 
Conventional 15 ' 7.93 18.5 5.86 59.7 15 15.47 16.5 5.44 51.6 



Table 3 

Summary of 1 -Tests for the Diagnosis Test 
SiniulatiQQ Converrtional 



M SD 


M SD 1 


df 


a 


Total Test Score 27.7 6.2 


12.47 4.9 7.52 


28 


<.001 


Comoonent Problems Solving Skills 


Involved in the Test 






Prioritizing Alone 8.4 4.7 


1.47 1.9 5.27 


28 




Stress Management 4.9 1.8 


1.73 2.3 4.27 


28 


<.001 


Identifying Health 
Problems & MaKing 
Correlated Changes 14.4 2.1 


9.27 2.8 5.66 


28 


<.001 



Table 4 
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Means (M) and Standard Deviations (Sr. fo: Handicappe-^ and Non-Handicapped 

Students on the Two Academic Measures 

U M SD 

Nutrition and Disease Test: 
Total Score 

Mildly Handcapped Students Taught By: 

'"nrulation 15 22.00 3.72 

conventional 15 17.93 5.86 

Non*Hanucapped Students: 15 19.47 4.94 

Nutrition and Disease Test: 
Items Reinforced by 
Health Wavs 

Mildly Handicapped Students Taught By: 

Simulation 15 7.33 1.35 

Conventional . 5.60 2 20 

Non-Handicapped Students: 15 5.53 1.46 

Health Ways Diagnosis Test: 
Tgtal Sccffl 

Mildly Handicapped Students: 

Sir.iulation 15 27.73 5.89 

Conventional . 15 12.47 4.88 

Non-Handicapped Students: 15 18.07 6.03 



The Effectiveness of Video^sc bistniction in 
Teaching Fractions to L^^uni^is^'p^ and 
Remedial (fish Schcx>l Students^ 

BERNADETTE KELLY* 

DOUGLAS CARNIhE 

RUSSELL GERSTEN 
and 

BONNIE GROSSEN 
University of Oregon 

This study compares theeffectiveness of a vklcodisc cuTTK 
incorporates prmciples of aistnictional design (indiK^na discrimina* 
tk>n practkre cumulative review) 

designed to teach bittic fractions skiy#> T wen ty eig h t high school 
students, including 17 mihiy handicapiied shidcms» quaKfied for the 
St udy by showing (a) mastery of whole number opetations and (b) less 
than SO percent mastery of the fractipnt sldb to be taiight. The 
students were matched in pairs based on a pretest score and math 
scores from the California Achievemem Test, and ihen randomly 
assigned to one of tlte treatments: During the ten^ intervention, 
observers collected data on leveb of treatment implementation and 
student on task behavior. A criterion-refitrenced ppsttest and two- 
week maintenance test were administeriRL The videodiac curricuium 
resulted in sigiifk^ndy higher posttest and inaintehance test scores. 
Levels of on task bdiavior were stgnificiMMV high^ 
sessions, althou^kveis in botiiconditionswere above 80 percent. 
An analysis of stud nt error patterns indicated that differences in 
instf uctk>nal design «eatures contrftxited to the rdative effectiveness 
of the two curricula. 



The National Assessment of Educatk)nal Proyress rer^orted that, natk>nally, "performonce < A 
fractions computation is k>w, and studenU seem to h»ft done their computation with littU* 
understanding" (Lindquist, Cdrpenter.Siiver& Matthews, 1903, p. 16). For example, the assess 
n^nt found that only one third of U.S. seventh-graders can add 1/2 and 1/3. The problem is even 
more pronounced for handk^pped students. 



^Requests fc>r reprints should be addressed to the first author. 

2The research was supported in part by Granl No. G00840066O02 from the U.S. Department ni hdm *tut .n 
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VklMdisc Instruction 
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Research on cHective instructive practices with special education students gjves «.m.; 
.nd,catTn5 how .o ^prove instruction. En^ (1984) measured mikUy hand^capprf s.u^^^^^^ 
orowron^^ o( basK skUk measures and correbted this growth wrth observ«l teac h,^ 
SmZ^ l^re effect«,e teachers (classified on the basis o^ high St"*"'* '^f 

'^''Iw^^^iroved teacher trainingandimpro«ed teacher presentation techniques inayj«.t 
be JZn^Sum itself is being caMed into question. Ten y««^^ » «P°'l!'7, ' 
NaScoun^iliTeachersoei^thematics(Carpem 

,^^1^^^^the waw fractions were tau*t in cajwentional curricula. More recently. i\h 
S;:2SS.^S.^ES^S^dedarS^ 

,eachVSta;:toS«eds carSi in*«ti9alion. This study compares two cumcula deigned .<> 
StoskfrXS-atraditionalbasalcurriculum.andan^ 

& cSSne's (1982) theory <A instructional design. The basal program was selected 

S^^ts «hth^iation pagesspedfirf r*i*»aiiv;^oA^andx^ 

l.?n ha\^ strate(ies It also includes practice with concrete objects (mampulatives). a feature 

«romn^C^ nJa^ educator^e believe it represent, one of the better basal texts 

^"^''^I^' Mastering Frocfions program (Systems Impact. 19K) has been developed from o 
resea^hbS^ryofinstnKtion(Engelmami&Carnine.l98^^ 

nrSes dM^s3ional design. It uses interactive videodisc to replace conventional t.x.. 

leth* t^JSSrto^' des«n /eomrS incorporated into the videodisc program can mcr.-.s.- 
rtudent i^rfc^Im^e It a^ued that the videodisc medium may contnbute to mcnMM-d 

instructional technotogiw arc rwjrevehK^^^ 
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of delivery, as thecurriculum design seemed to bea major variable in determining the effer tivtfnt'bs 
of an instructiortal program. 

ConpMiwm BcliMm the VidMdtoc atid BMd Cttiricuh 

In order to elucidate the principles of instructionai design incorporated m the experimented 
curriculum, this section compares theMosfavigFWicljonsprogranid^ basal program used in 
the current investigation. The section covers the folowing curricdum dimensions: review pro 
cedures, discnmination practice, example selection, and explicit strategy teaching. 

Review Proccdwm 

In the basal program, a skill is introduced and practiced but then "disappears" for seveml <l.ivs 
For example, Mathma&is Today teaches multipiication of fractions in one lesson. In suteequi nt 
lessons, other skills are introduced, including muUiplication of whole numbers and fractions, uikI 
multiplication of mixed numbers. However, in the next three lessons students work with word 
problems, reciprocals and division, after which students are expected to perform the multipk .uu m 
of fractions independently on review and test lessons. 

In MastmnQ Fractions, the skill of multipiyins fractions is introduced and then praciK t d i »n 
even; subsequent lesson in theprogjram. Each new skill that is taught is reviewed cumulativdv. • .r 
else incorporated into mcjre compkfx skills. 

DtscrimifiatkNi Practice 

btudcnis who liwn to carry oui certain steps again and again on the same tyix- ot probU m 
iTwy have difficulties when they encounter differenrprobimi'types mixed together^m ti rcM. \ m 
example, a 14.day unit in the basal program introduces adcSng and subtracting fractions. In ilu- 
next unit, students learn the strategies for nnultipiyingand dMdingfractions. No practice is given i in 
c/iscnminarion between the strateciies te.g.. muhipiicatici^ and addition). In the revievy ird test 
lessons, the problem types are still separated. Students never receive discnmination pr<K ik e 
between strategies. After the 'wo units. fractk>nsoperationsdonot ^«)pear again in the text l<ir ilu- 
remainder of the school year. 

In MostennQ Fractions, a skill is introduced, practiced, and within a few lessons mixed with 
other types of problems. For example, exercises in tHelesson presentatkKi specifically addressi he- 
differences between addition and multipKcatkHi strategics. If students have diffeulty makimi ihe 
discrimination, specific remediatkxi is given, after which students are required to work a set ot 
problems involving both operations. The skills are then integrated with other types of probk^ms on 
every worksheet. 

Darch, Gamine and Gersten (1984) compared the effectiveness of a regular basal nuitlir 
maticscurhculumwithacunriculum program sinwlartoMosteir«fri^ incorpoiuied 
systematic discrimination practkre. Students who received discnminatk>n practice performed 
significa. .ly better than students who dki notion a chterk>n*referencedposttest and mainten«iiue 
test. Englert (1984) also emphasizes the importance of discriminatkm practkre for mikily h«mdi 
capped students, to avoid confuskm between related concepts. 

Example Selection 

Raagt of E»amplM« In the basal program, when students first encounter pkrtures ot 
f rac tk)ns, all examples are less than one. hi the next grade level, mixed numbers are introduced tis . i 
whole number arxl a fraction, reinforcing the misconceptkxn that fractk>ns can only represe nt 
qualities less than one. Improper fractkms do not appear until the next grade level. A common 
error occurs when improper fractk)ns are finally introduced; students represent these fractK)ns cis 
less than one; e.g.. for 5/4 students write: ^^gorv 
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Mostcrmg Fractions teaches students a strategy for reading and writing both proper and 
improper fractions from the begin n ins 61 the'prbgram: 

(a) The denominator telis the number of parts in each unit: 

+®®® 

(b) The numerator teiis the number of parts used or shaded: 

The wide range of examples prevents students firom fc)nnin9 misconceptions and gives students a 
more complete understanding of what a fraction lepicacnts. 

In a carefully controlled experiment, Carnine (1980) demonstrated how a limited range of 
examples can cause students to form misconccptiom. The instructional task v«as to write hun 
dredths fractions asdecimab. One groupolstudertswaspresemedwithawiderangeofexampli'^ 
with numerators of one, two or thnw (tfigits («.g., I^IQO. VIOO, 7S/100). The otKe^ group w.«h 
presented with a limited range of exanu*»rall ra«w«oir»comi?^^ two digits (e.g.. 2iB/100. 
84/ 100, 55/100). Carnine hypothesized that iMudehti in the limited rai^ group would learn the 
misconceptionihatthededmal point isalwaii»plac«ddif«^ 

(i e 4/ 100 = .4, 185/100 = .185). ffe preA:ti«l that tKcs* students would not be able to generalize 
toother examples was verified Studmi»inth«lWt«djrwfli9roupss 
on the problem types K/100. >0OV100o«it^iminedtot«|k»rti€St. Students who had received thi' 
full range of examples stored 89% and 93* rtipKtwdy, • 

Easiy ConfHMd Labels. When hi^ simiar terms (e.g., the terms numerator and dehonv 
inator) are introduced at the same time, thew isan incr a a sa d Itdihood that students will bet imu- 
confused. In the basal program, the tenrn iwmcrator and denominator were introduced in tiu' 
some lesson. In subsequent fraction «iafflptti;tht ta ach iw r ehrred to the terms numeruior .uul 
denominator, and the labels appeared on some worksheets, biit no systematic teaching ensured 
that students could successMy ap|*> the labiii to the fijpropriatftparts of a fraction. 

lntheMasfir»igFrocl»nspro9ram,theteiii»nume«i6ca^ 
several lessons; so that students were faidle with one label before the other, similar label was. 
introduced. Thisproceduredecreasesthe likelihood that students wiBbeccmeconfus^ 

reveraals. 



In the basal pro<yam, students are not always given an explicit strategy to solve a problem. 
This couW lead to student misunderstandingi. Equivalent fractions serve as an example. In the first 
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set of basai exercises* pictures of the two equivalent fractions* and three of the four fractujn 
numbers are given; the students count the number of shaded parts to complete the problem: 
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Students can write the fourth number and complete the equation without understanding equiva 
tent fractions. The students count the shaded parts and write the numerator. In the find set of 
exercises given that day* the pictures are removed. 

3 . ^ 



The student workbook states* "You may draw a picture to help you.** At least some students 
will not be sure how many parts todraw or shade; unless* of course, they already know how to write 
3/4 as ^8. 

In Mos^ering Fractions, the strategy for equivalent fractions emphasizes this rule: when you 
multiply by one you don*t change the vahie. When a fraction is multiplied by a fractk>n equal to one. 
the original fraction is equivalent to the new fraction; i.e.* 

1x1-1 
1x4.4 

^ TT T 

1 m 4 

2 • 

With this conceptual basis for equivalent fractk>ns, students are introduced to the strdt(*(iy (c n 
determining the missing number when given a problem; e.g** 2/3 =?/6* First* students idein.,v 1 1 it- 
fraction of one they multiply 2/3 by* so as to erul up with 6ths in the denominator. The dviioniinaM n 
of the fraction inside the parenthesis is 2. so the fractk>n equal to one is 2/2: 2/3 » (2/2) ^ 6 11 ui^. 
the missing numerator is 4: 2/3 - 4/6. 

i<ameenui, Camine* Darch and Stein (1966) compared a basal approach to uuroduunM 
fractions with a strategy-based approach sin^r to that found in the Mostering FriK tions t urru u 
lum. For the explicit rule-based strategy group, the teacher dismonstrated concepts ond skills in i 
step^by step fashion. Teacher guidance was gradualy and systematicaHy faded until st udent > wet « 
performing independently. Conrection procedures directed students to the explicit instruc luni 
they had received. In contrast* the basal approach was much less structured, Emphasis w<ib eil 
on activities usingstudent discussion and the use of manipulatives. Students in (he explk:it st rute^v 
group perfonmed significantly hi^er on a criterion referenced posttest and on a transfer tesi c »i 
related fractions skills. 
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into the videodisc program '^^.^^V'^g;^^^' ScKtiona! basal program on student 
compared the relative effects ofiJ'^l-^fi^^S^ 
acquisition of skilb in a un,t on fractwns^ O^ro^^ 

Good. 1986) were constant 
ing of the instructional examples. 

w .ei«>«tu«hiahschoolmathclasseswercscreenedfor:(d)mast»;ry..» 
Prior to training, subjects from twoh«hscfto^ 

thepreskiUsnecessajytoteamb^f^c^^ 

the specific skills to be taught. Two '^'^^^I^^SKS 22 students. Eleven w r. 

.n the study. One of the ^SlfSTnit^StSh » *»«dents were ,u.. 

dassiffed as ^arnmg;^^ 12 ninth graders in nml ... 

classified as handicapped. The otto gwerai "^SmtHt-^aie LD students. Students wtu- 

were assigned to «ch ^'^^/^l;^^^ .u, ^^dy and took the posttei«; 26 students took thr 
Out of 34 subjects. only 28compl«J2 for 
maintenancetest.Subj«tattrit«i«^^^ from Asia w,th 

more than 50* of »»*,5?rt2St1S5;S?iSSi^ ^ 
insuffk:iehi language skrfb to benefi^fron^^ 

,enancetesta5j«w^madet«t«^c«rt«M^ 
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A screening test, developed by the experimenter, was adniinistered to ensure that students 
had mastered the requisite whole number skiiis for a unit in fractions (i.e.. facility with bdsic 
addition, subtraction , and multiplication facts). The first part of the test comprised ten of the more 
difficult facts. All students who were tested achieved at least 80% and .vere eligible for the study 
based on this criterion. . \ . ... . . 

The second part of the screening test wascriterion-referenced to the skrtls to be taught m the 
fractions unit. Students who scored above 50% on this part were ineligible for the study. Tvn 
students were exduded based on thi$criterk)n.BigiblesubiKtew^ 
total math scores from the CaUfomia Achievement Test (CAT) and on pretest scores. Individual 
students within each pair were then randomly assigned to the two treatment conditions. The mean 
scores on a 6-item pretest for the videodisc and basal groups were Z.4^(SD = 1.16) and 2.1 
(SD = .90). respectively. There were no signific^t dKftrtnces between the groups. Mean scali-d 
scores (expanded standard scores) on theCAT wereSl I for the videodisc studentsand 504 for thi* 
basal students. Standard deviations were 56.25 and 60.6, respectively Again, these differences 
were not significant. The mean scale scores corresponded to percentile score equivalents of 29ih 
percentile and 26th percentile respectively. (CAT scores were rtot available for all students; N= 10 
for the IV group. N = 14 for the 3T group.) 

Measures oS Achievement 

The prin(^ipal measure for the study was a criterion-referenced test {CKl) developed by the 
experimenter. Two parallel forms were developed asa posttest and a two-week maintenanci' icst 
The test included the following skiiis. taught in both the IV and BT conditions: writing fractK)ns 
from pictures, vocabulary (e.g.. denominator), addition and subtractmn of fractwns with like 
denominators, multiplication of fractkwis, and multiplication of a fraction and a whole numbt-r. 

RdtabiBty and Item Analysis.Fiekl test versions oltheCRT were given tothirty fourth .)nd 
fifth-graders who had had some fractions instructkMi. Internal consistency reliability was assi'ssfii 
for each form; coefficient alpha reliability was .98 for post and .98 for maintenance. Allerna;*' loi m 
reliability was also evaludted; the Pearson correlatkm coefficient between the two fornis w.is «H> 

Measures of ClaasroomVariaUte . , . 

Two classroom variables associated with higher student achievement are (a) total time 
students are octwe^ engagid in instructional activities; time 'on teak' (Rosenshine. 1983) and 
(b) student success rate whUe doing independent scatwork (Fisher ct al.. 1980). 

ActivtEaMMMMrt. An observatbnal recording form was designed to measure the extent 
lo which students v«re actively engag8dduringinstructibn;Each9rdupofs 
either three or four times during the study. Student behaviors were recorded with a six-second 
momentary time sampling procedure. 'On-taslt'bawwiors included answero^g questions, writing, 
and watching the teacher or the monitor during the les«xi presentatton. Behaviors recorded as 
•off-task" included gazing out of the window, sMns. or chatting to another student. Otiu'r 
behavfors (e.g., passing out papers, waiting for t«adw«MistaiKe> weft record^ as transitiondl 
activities. Observersrecordeda»on.taskbdwiprsasplus(+).alo«.taritb^^ 
and aU transitional activities as a xeio. Thus it was pbssMt to determine the proportion of 
instructfonal time spent in each ol the three behavior cat^iortts for each of the two coitions. 

SvccMsKMa itudems' independent scatwork wascdMted^t the endof four observation 
lessons. The percent of proWemsattemptedand the percent that were successfully comptetedw 

cakulated. 
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Measures of ImptenxntaHon , , . 

Imolemcntation checklists were used to kientify those etements ai the teaching models iImi 
were consistently implemented and those implemented at tower ksvels. The checklists were simiU.r 
to the form developed by C5ood. Grouws & Ebmeier (1983). AU items on the checklists wt.. 
operationally defined. Below are two sample items that were applk:able to both instruciion..! 
models. 



YES 


NO 


NA 















i) Did the teacher award points for independent ^work done on 
the previous day? 

II) l)\d the icdchur circulule during independent \work reinforc- 
ing jppropridle behavior? 

Items relating specifically to the IV model (e.g.. whether the teacher checked student ,H'r». .1 
mance at the sp^ified points in the k«oo. or administered a daJy review quiz) were devdo k .I 
JSngthe vkleodS: teacher's gukie. Itemsiipplicableonly to the basal text method (e.g.. wlu-i u-. 
rhT?eacherprolkled an opportunity to use manipulatives. or whether «he te<K;her supplK < 
examples in Sidition to those presented in the text) were devetoped using the basa text U'a. »• 
pJSentation book. Each item scored in Jhe 'yes* category by tlvE observer was tallied, and tlu- 
percent of total checks possible was cakulated for each lesson obsen/ed. 

'^TTesl^.^lt^dJXed. based on the work of Fennema and Sherman (1976, 
Students were a?kS thS opintoSon a S-point scale in response to a series of statements 1 .... 
related tortudents^^^^^ 
instance, 

1 . 1 think I could handle more diffk:»jlt fracttons. 

2 Learning fracttons is a waste of time. 
Items were read to students one at a time and the questton asked. "Is this true for you? ' Siudt-nis 
responded to each item with: Yes. No or Don't Know. 

Materiab 

*"**T£'ratTlSl^ired for imptementatton of the IV fracttons curriculum were: a vldeod.s< 
Diduer the vkieodiscs. consumable student worksheets and teacher answer keys. 

■_n«uw* »L^_«.ii:.»:«i.L;ll«;i»»M«*i«'«4intKpnrt»WK>usli'h 



ivo."I^nwrabriefquizcoveringthe^^ 
Lrin pre2Sa*kI> foltovid next-an explanatton foMowed by written probtems k,r oi 
SS-al skS Aft« completing the lesson, students were assigned independent proWt-im 1... 
S^SiSrk T^eC«?ksheT^^^ 

Icarn^JhusJar currtoulum. every fifth lesson was a test. T.-achors iis.d iIh- 

tests io de!Zine^whether a review of particular s^^^ 
preceding the test lesson. 



^e Siterials required for implementatton of the BT fractions curnculum were: a tm h.T 
pre JntatS^ took («.^th afiswer keys), student textbooks and c?~S»Weworksheets. In som« 
lessons, manipulatives were also used. e.g.. paper stnps or fraction pie models. 
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\ 

Usmh FoTMt Each 30-minute lesson wa^designcd to teach a singie objective. Each lesson 
began with an introduction, in which the teacher ^ised discussion and denionstrations to develof) 
ideas. Next, the teacher guided students throu^ several examples in the student textbook before 
assigning in<lass problems. After completing the lesson, fottowHip activities, usually involvimj 
nrvanipulatives, were used to consolidate the concept developed in the tesson. Students were then 
assigned independent problems for seat work* The worksheets comprised 20-40 items focusiny on 
the student objective introduced that day. 

Review and Unit Tcttt, Review tests were provided at the end of the unit, sampling each of 
the major skills and concepts that had been introduced. Teachers used the results of ^he revi(*w 
test to reteach concepts and skills that students had not noastered. The unit test was pre^nted the 
next day. The review and unit tests sampled the same skills in the same order, and hod <i 
standardized lest format. 



ProccduTM 

Teachers were the experimenter and a research assistant from the University of Oreyoii. 
Each teacher taught one condition for one-half of the study, then changed conditions for the 
remainder of the study, to minimize teacher-student effects. 

Monitoring Impkmentation 

1 he teachers were observed onfour.occasions to assess the level of implementation rn h 
classroom. Teachers received specifk: feedback on their performarure, using the Implementutiur. 
Checklist (discussed under McnaurM). Throughout the study, teachers discussed any problems 
associated with the implementdtk>n of the two approaches. 

Observers 

Two trained observers recorded students* time on*task and percent correct responses nn 
independent worksheets, on three or four occasions foir each group of students. Before colkn 
the experimental data, the observer > practiced using the instruments until inter-observer rehabilti v 
exceeded 85 percent. 

Adminietration of Mcmutm 

Criterk)n -referenced tests were administered to all students participating in the study immc* 
diately following the completion of the unit (posttest), and two weeks after completk>n of the una 
(mairuenance test). 

Students* on task behavior and success rate, and levels of implementation were measured on 
the second, fourth, seventh and ninth days of the interventk>n. The experimenter conducted 
student attitude surveys before and after completk>n of the study. 



RnMlto 

The primary dependent variable was student performance on the 12 itemcriterK)n refen-ru rd 
test (post and maintenance). A 2 2 ana^^sis of variance (ANOVA) was perfornoed on he C'K I 
scores. The between-subiects factor was the instructional method (videodisc vs. basal text); the 
within subjects (repeated) factor was the time of test (post and maintenance). Signifkont inuin 
effects were found for the instructk>nal method [F* n2B,p< .001) and for the time of test (/* 
4.53, p < .05). Figure 1 graphk:ally depicts the mean scores for students in each condition on the 
posttest and maintenance test. Tabk* 2 presents the descnptive statistks for the cDinixirisoti 
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Students in the videodisc and basal conditions were on'task 96% and 84% respectively of the 
total instructional time duringobaefvatton periods. A Mam Whitney U Test indicated a significant 
difference between the Iwo condMons (U«3.5,p< .006). Students* performance on independi-nt 
seatwork was 96% correct for the BT gRNip and 91% correct for the IV group. 

Levels of implementation were extremely high In both conditions; 93% of the pebble impl*- 
mentation behaviors were observed kt the fiT condition, and 92% !n the IV condition. Jhv iH'm ili.ti 
was weakest in both conditkMis referred to the teacher awarding points lor the prevkHis day's 
independent work. 
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Responses from the student questkmnaircs were summariied and assigned a score ranoiiiM 

from -1 (aU negative responses) to +1 (aN positive responses) for the students' perception ..i 

(a) their competence in working with fractfona. and (b) the relevance of fractwns for daily ifc 

Students in both condittons made simitar smw in perceived competence and retevance. I h.- 

results are summarized in Table 3. ^ 
An analysis of covariance was performed, wid) pretest scores serving as the covanate. There 

were no siyiificant differences between the two inelnictfonal groups for either type of questKin 

(ability or relevance). This inckides no signlicant dMeience in the change in attHude between the 

two groups. Both groups showed growth in perceivwl compWence. This parallels the observed 

growth in skia as measured on the criterion-referenced test. Only trivial increase in perceivml 

relevance were found. 
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TakbS 



Type ol Question Prt< Pbtf Typtof Qmstlon Pft^ Pom- 

Competence Rdtvance 

IV .10 .81 IV .48 S7 

BT -21 .54 BT .15 

**Scores range from - 1 (very ncgitivt) to (very poeitive). 



Ditcii88ioii 

The resultsof the experiment suggest that Iht differtnt instructkmaldesign fed ures in flu* tw( > 
curricula produced different levels of student mastery of the content covered. The students 
receiving the videodisc curriculum scored significanHy hi|^, both on the cnlerion refcrftut ci 
PQsttest and on the mamtenanct test. The videodisc scores also dropped less dramatically tmn 
time— a nonsignificant dro; of 1% compared to a drop of 71^ for the basiri text students. 

While a significant difference was found between the two condLons for students* on Msk 
behaviors, it should be noted that levelsof qn-taskwerehiiMn both co nd M on s . Students receivinq 
the basal lessons were weR motivated and actively involved during the lessons, ^imilarb^, st udc nt 
success rates on independent worksheets vvere very hi^iinbothcondit^^ 
for the basal students and videodisc students* respectivsiy* These findings imply that the instna* 
tK>nalquality of the tVcurhculum(rather than other process variabies)wasi^ for 
the differences in student performance. 

Pattenisof student errors aisoconlirmJheimpoitanceof the specific differences b^ 
programs. For example, a large proportion (7S%) of students in the basal treatment made errors 
when asked to write the fraction for a diagram representhg a fraction greater than one. Given the 
diagram 




•B 



56% of the basal students wrote 5/6, even though aH students couM identify 



©■B 

as 1/2. The inability of 75% basal text students to ex l rqpotoie fo fractfons greater than chm* is «i 
predictable consequence of al examples less than or equal to one during the treatment 
interventfon. In contrast, only 8% of the videodtoc studeMe» who had brrn exposed to frac!k)ns 
greater than one, exhibited thiseriof on the postteet> This parslsk the resute 
( 1960) cited earlier> For farther discussion on the relationship between student error patterns and 
instructkmal design, see Kely <1966). 
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dfective mathematics mstru2«n,Thec^^ 
ticoal design features, together with its co«eHectivene8s,o«nwn.w« i~ 

instructional tool. 
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Quite commonly the differences between novices and experts are attributed to 
general intelligence, superior problam solving skills, or imagination (Larkin, 
McDermott. Simon & Simon, 1980). A closer examination of experts, however, 
reveals something else. Recent work in cognitive science has underscored the 
importance of subject area knowledge as a main factor that distinguishes novices 
from experts. Chi (1978), Chase and Simon (1973). Jeffries. Tumer, Poison, and 
Atwood (1981) have conducted several interesting studies which indicate differences 
in these knowledge levels are mainly due to substantial practice, a thorough ' 
familiarity with the subject area, and a fadie use of reliable strategies for solving 
problems in the particular subject area. Experts can rapidly retrieve items relevant . 
to the problem at hand (Chase & Simon, 1973). As a result, experts solve problems 
considerably faster and more accurately than novices. These studies show that the 
primary difference is not innate ability, but expert's superior use of content specific 
facts, concepts, and problem solving strategies - a state arising from instruction and 
extensive practice or structured experience. 

This research has considerable implications for special education, particulariy 
for mildly handicapped secondary students who need to learn more than just basic 
skills. Some educators (e.g., Goodlad. 1983) note that these students, when 
mainstreamed, are expected to graduate from high school with their non- 
handicapped peers. Yet secondary special education instruction often amounts to 
simple drill on elementary facts and concepts. Although automaticity in math facts, 
foi' example, is important, these students also need to learn infonnation that is more 
in-keeping with what is taught at the secondary level. If these students are to master 
many of the same basic requirements as their non-handicapped peers, as some 
educators have suggested (e.g.. Meyen. Alley. Scannell. Hamden, & Miller. 1982), 
then they need to be taught complex concepts and problem solving strategies as 
well as elementary facts. 
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This view is not intended to imply that mildly handicapped students will or 
should become experts in a subject area. Rather, mastery or competence in specific 
knowledge ~ a less thorough, but adequate understanding of the material ~ is a 
more reasonable goal, in essence, students need to be taught how think more 
"effectively" about a subject area. Students should ler^rii core facts, concepts, and 
rules at an automatic level and then use explidtty taught strategies to solve a range 
of challenging problems. 

Destnnin? (nstnjctlon for the Secondary Level Through CM 

Teaching these skills involves well designed instruction within a specific content 
area. Such instruction shouldlhorougitly describe the diffensnt stages that lead the 
mildly handicapped student from a novice state to a level of competence. A 
description of this process, which is always gulaed by the content as well as by 
instructional design principles, shoukl detail how best to organize different kinds of 
knowledge in a content area. This description shouki also include the 
interrelationship between different kinds of knowledge (e.g.. how knowledge of basic 
facts and concepts relate to problem solving strs^egies) and the optimal means for 
teaching each kind of knowledge. Some instructional designers (Case & Bereiter, 
1984; Engelmann & Camine, 1982) have already begun to articulate the many steps 
that move a student toward competence in a subje<.1 area 

For the last two years we have been studying the effects of CAI on mildly 
handicapped secondary students. CAI was chosen because of its increasing 
popularity in the schools and more importantly, because it is useful medium for 
embedding and then testing many empirically based instructional design principles. 
Our research has been conducted in three different areas: fact instruction In 
vocabulary, concept instruction in slementary logic, and problem solving in health 
education. Each of these approaches represents a different level of teaching in the 
^ novice to "competence" continuum. 
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As curricular approaches, the programs are designed to be tutorial or 
compensatory (Deshler. Schumaker. Lenz & Ellis.1984). That is. the programs are 
designed to either teach the student secondary level information in the quickest, 
most effective means possible (e.g.. vocabulary instruction) or the curriculum is 
altered in such a fashion that students are taught comparable information in a 
different way. Technology, when used selectively, can assist this process by 
relieving teachers of time consuming, relatively low level teaching and. in some 
cases, BS a way of conveying information not easily presented by conventional 
means. 

In our experimental studies, the major focus in the data analysis has been the 
comparative performance of handicapped students randomly assigned to various 
experimental conditions. However, we have also tested samples of non- 
handicapped secondary students on the criterion referenced post test measures 
used in th« study. This quasi-experimental procedure was used in order to gauge 
the progress of our experimental students in becoming competent in a content area. 
Our goal was for the subjects in the experimental treatment to a) surpass mildly 
handicapped subjects in the comparison group and b) perform at a level similar to 

non-;iandicapped student-, jn the skill. 

The items contained in our measures are well within the range of typical 

instmction at the secondary level. Where in some cases non-handlcapped students 

may not have been directly taught the exact content prior to test administration (e.g.. 
• a particular vocabulary word or how to derive a conclusion from two premises), it is 
- not unreasonable to expect that many of these students could have gained this 

knowledge on their own. Perfomiance levels of non-handicapped students were 
' significantly above a chance level of responding, enough to justify this assumption. 
Insofar as the differences between the experimental groups and their non- 

handicapped peers diminishes after instruction, we are better able to understand the 
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combined effects of instructional design principles and technology on the acquisition 
of domain specific knowledge. Furthermore, we can judge the extent to which mildly 
handicapped students are meeting basic academic requirements. What follows is a 
description of each program, *he results of our quasi-experimental analyses with 
normal high school students, and the implications for knowledge development within 
a specific content area. 

Teaching Vocabulary! Instmcfion at the Fact Lave! 
Teaching vocabulary is regarded as important instmctional activity, particularly 
as word knovJedge is highly conflated with reading-comprehension skills, in light of 
this, many researchers (Anderson & Freebody, 1981 ; Pearson & Gallagher, 1983; 
Tierney & Cunninghar.1,1984} have looked for effective ways to teach vocabulary. 
Unfortunately, those methods which were most successful also required a 
considerable amount of instmctional time. For example, a study by Beck, Perfetti, 
and McKeown (1982) attempted to teach only 104 words in 75 thirty-minute lessons. 
At the end of the study, students knew an average of 85 words that they did not know 
prior to the program, but this took 2,250 minutes of instmction or approximately 26 
minutes per word. This amount of 'tme is considerably more than that typically 
devoted to vocabulary instruction in the middle grades (Durkin, 1979; Roser & Juel, 
1981). 

Computer assisted instruction. It would appear, offers the advantage of 
increasing instmctional time on such a low level task without pladng increased 
demands on a teacher's already limited time. Students shoukl be able to master the 
words more effectively if they are given extra practice on difficult words and 
cumulative review throughout the program. In the end, it is hoped that this 
knowledge will be used later in reading and writing activities. 

We compared two methods of computer assisted instmction (CAI) for teaching 



vocabulary to mildly handicapped secondary students (Johnson, Gamine, & Gersten, 
1986). The study examined the Impact of two instructional design variables: the 
effect of size of the teaching sets and provisions for daily and cumulative review on 
the acquisition and maintenance of word meaning. Two CAI vocabulary programs 
were used to present the same 50 words and definitions. 

The experimental software program used in the study, the Small Teaching Set 
program (Gamine, Rankin, & Granzin, 1984), begins by testing students on a set of 
50 words. Lessons are created using only words student could not define on the 
pretest. The program then provides instmction on a "teaching set" of no more than 
three new words. After initial instruction, these words are then added-to a "practice - 
set" consisting of a maximum of seven words. The student must meet a specific 
mastery criterion on each word (i.e., two consecutively correct responses in each of 
two lessons) before It is removed from the practice set. Once the student has 
mastered ten words, the program tests the students on these words. Missed words 
are placed in the practice set and retaught. Rgure 1 is a visual representation of the 
practice and review schedules embedded in the program. The figure shows how a 
word moves from an initial test item through a practice set to a final cumulative 
review lesson. 

[Insert Rgure 1 about here] 
The comparison program, the Large Teaching Set program, teaches words in 
sets of 25 words (Davidson & Eckert, 1983). The student may choose to learn the 
words in any of four types of fonnats: (a) a teaching display which shows the word, 
its definition, and one example sentence; (b) a multiple choice quiz format; (c) an 
exercise in which a definition is displayed and the student must spell the correct 
missing word to complete a sentence; and (d) an arcade-type game in which the 
student matches words to their definitions. No cumi'lative review fomiat was 
incorporated into the program. 



Comparative Pftrfnrmflnce of Milrilv Ha ndicapped SttiriBnts 

Twenty-four mildly handicapped high school students were randomly assigned 
to one of the two CAI programs. The same set of 50 words were used in both 
programs. Students worked individually on their assigned program 20 minutes a 
day until they reached mastery. These words, 25 verbs and 25 adjectives, were 
considered important by two secondary special education teachers. 

P\\ students were given a 50 item, multiple choice test on the definitions as 
soon as they achieved mastery (i.e., 90 percent). Ten of the twelve subjects (83%) in 
the Small Teaching Set program met mastery criterion by the end of 1 1 sessions, 
while this was true for only eight of the twelve subjects (67%) irr the Large Teaching- - 
Set program. The study was terminated after 1 1 sessions because the remaining 
subjects were experiencing fnjstrations. The mean number of sessions to mastery 
(for those who reached mastery) was 7.6 for those In the Small Teaching Set and 9.1 
in the Large Teaching Set program. Results of a 1-test indicate this difference is 
significant (s, < .05). Hence, subjects in the Small Teaching Set program mastered 
the 50 words in significantly less time. In addition, more students in the Small 
Teaching Set program reached mastery within 1 1 lessons. Given that the groups 
achieved equivalent levels of performance on the multiple-choice tests, their 
difference in acquisition rates becomes even more meaningful. Subjects taught with 
the Small Teaching Set program required less time to meet mastery criterion on the 
words, yet their posttest performance was equal to that of subjects in the other 
treatment who took longer reaching mastery. 

Comparispn of Mildiv Handicapped with Non-Handicapped Studen»g 

The same 50-item multiple choice vocabulary test was administered to a 
sample of 30 non-handicapped 10th-grade students in a regular English class. As 
Table 1 demonstrates, the posttest mean scores of the mildly handicapped subjects 
were slightly higher than the non-handicapped students' mean score. After a 



maximum of 1 1 sessions of computer- assisted vocabulaiy instruction, the 



performance of mildiy handicapped subjects on the multiple choice test was very 
similar to that of non-handicapped 10th grade students. 



implications for Software Design 

Two issues arise from this study. Rrst, the size of the teaching set and 
schedules for review led to an significant difference in learning rates between the 
two handicapped groups. Examph set size and review schedules are comparatively 
subtle instructional design principles, yet they are essential for tasks where a 
considerable amount o.^ practice and memorization are required. 

Second, a minor finding in the study had to da with the arcade-type game 
contained in Large Teaching Set program. During the stuJy, some of the Smail 
Teaching Set students occasionally asked the experimenter why they didn't get to 
play a game like the one in the other program. However, after the study, students in 
the Small Teaching Set were asked what they specifically did not like about the 
program. Not one subject mentioned the lack of a computer game format. 

This finding, though preliminary, is important for CAI software designers who 
apparently believe that for educational software to be motivating, it must approximate 
computer games that are popular in vkieo arcades. Focusing on these kinds of 
surface features ~ rather than the instructional design considerations relevant to the 
task ~ may very welt lead to software progiams that are insufficiently structured for 
success. 



An understanding of elementary reasoning and logic typically precedes a 
student's further training in analytic thinking. Once a student has a firm grounding in 
basic reasoning skills, teachers are in a better position to show students how to spot 



[Insert Table 1 about here] 
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faulty arguments, identify false conclusions, and detect unwarranted generalizations. 
Zetlin and Bilsky (1980), however, suggest that educators create a SQif-fulfilling 
prophecy by not routinely teaching reasoning skills to special education students. 
These students consistently perform poorly on logics* problem solvirtg tasks (Spitz & 
Borys, 1977} and as a consequence, teachers often believe that these students 
cannot be taught reasoning skills. 

The Reasoning Skills program (Engelmann, Gamine, & Collins, 1983) was 
designed to teach students to: a) to draw conclusions from two statements of 
evidence and b) to determine whether a three-statement argument was logical or 
illogical. The program taught students the three pcssible-key words Ysome. all, na) • 
that can begin any statement in an argument; their relationship to inclusiva, 
overlapping and non-overlapping classes; and relevant rules for constructing 
arguments. SlJdents were also taught to identify unsound arguments by citing one 
of three reasons (e.g., inappropriate key word in the conclusion, the appropriate 
class size is not named in the conclusion). 

The major strength of the Reasoning Skills program Is the teaching of an 
explicit, step-by-step strategy based on a series of carefully controlled rules. Rgure 
2a represents the skill of drawing a conclusion from two statement of evidence. 
This requires the student to first read the evidence statements and check for key 
words that begin each statement. On this basis, the studem is able to use a set of 
rules to first determine the key word in the conclusion and next, to complete the rest 
of the conclusion based on an examination of the classes In the evidence. 

[Insert Rgure 2i about here] 

Figure 2b portrays a more somplex task: critiquing an argument. A student must 
read both the evidence and conclusion and determine if the conclusion follows from 
the two evidence statements. To critique our argument, the student must consider 
more features than when constructing a conclusion (e.g.. Implications of the key word 
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in the conclusion for class membership and order in the evidence statements). As in 
the previous task, the student must look at key words and classes. Howjver, he or 
she must now make this evaluation by using a set criteria (i.6., the multiple choice 
items) that force the student to apply all previously learned knowledge about 
arguments. 

[Insert Rgure 2t about here] 
The advantage of the Reasoning Skills program over more traditional 
introductions to elementary logic is that the program teaches concepts with a 
minimum of verbage. Concepts such as major and minor premises, middle terms, 
distribution of terms, and subject ind predicate distinctions are avoided. Even - ^ 
further. th3 reflexive relationship between the statements of evidence (i.e., their order 
or position can be interchanged with no effect on the conclusion) are demonstrated 
in the program rather than the typical method where the major premise is 
conventionally written first (Black, 1952). For example, consider the argun t... 

All French preskients are bakl. 

Some socialists ara French prasidants. 

Some socialists are bald. 

It would be common for the major premise (All French presidents are bald) to appear 
first, even though this is unnecessary. It is likely that students, particularly mildly 
handicapped students,'who continually see only this kind of ordering will have 
difficulty drawing conclusions when the statements of evidence are reversed. 
Comparativfl Performance of the Mildiv HandlcatiDed Studants 

The main interest in our study was to examine the effects of different correction 
procedures on two groups of remedial and mildly handicapped students (Collins. 
Camine & Qersten, in press). Thirty-four students were randomly assigned to one 
of two groups: the Basic Correction or the Elaborated Correction group. When 
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students in the Basic Correction group made an error, they were told the response 
was incorrect and were provided the correct answer. When a student in the 
Elaborated Correction group made a mistake, he or she was immediately corrected 
and an explanation related the explicit strategy that the student had learned earlier. 
This was the only difference between the two conditions. In both conditions, students 
worked individually on a microcomputer. Students worked on their respective 
version of the program until they completed five lessons. 

Student learning was measured on a criterion referenced test. The data 
analysis indicated a significant difference favoring the Elaborated Corrections group 
(Q < 001 ) on both the immediate posttest and a makitenanca test administered twc*.'- ■ 
weeks later. There was also a significant difference between the two ^roup$ on the 
transfer test, again favoring the Elaborated Correction group (a < .05). The transfer 
test used arguments embedded in prose passages. 

The two groups took roughly an equivalent amount o> Jme to complete the f>ve 
lessons, indicating the extra time required to read the elaborated corrections may have 
been compensated for by faster acquisition of the matedal. This Interpretation 
suggests that taking more time early in a complex instructional sequence to offer 
elaborated con^ctions may, in fact, lead to savings in instructional time later in a 
program. 

Comoaratlva Performancft of tha Mildly Handicanped Rtudants 

Following this study, the program was revised and presented to another sample 
of mildly handicapped secondary students. The Test of Formal Logic was also 
administered to three non-handicapped groups: a tenth grade honors class, a 
college level \og\c class, and college level education students. Table 2 shows the 
ANOVA results Part 1 of the Formal Logic Test. This section of the test measured the 
"tudents* ability to identify the key word in the conclusion and wrfte the remainder of 
the conclusion based on two evidence statements. Tukey post hoc comparisons 
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showed only one significant difference between the fii'st three groups (I.e., the 
instructed handicapped students, the honors class, the logic c'ass) and the 
education students (a'< .05). The education students from the university scored 
significantly lower than the instructed handicapped students and the other two 
groups. 

[insert Table 2 about here] 
Table 3 shows ANOVA results Part ii of the test. This section requires students 
to determine whether or not an argument is faulty and if so, select a reason. A Tukey 
post hoc comparison showed a significant difference between logic class and the 
other three groups (I.e.. the handicapped, honors class, and education students) (&< 
.05). There were nonsignificant differences between the last three groups. This 
finding indicates that on sophisticated reasoning skills, only the unlvereity logic 
students are competent. In contrast, Q^ the easier rea«>oning skills, all the groups are 
comparable except for the lower performing university education students. Most 
important, there were non-significant differences between the university togic class 
and the high school handicapped students on the easier reasoning skills. 

[Insert Table 3 about here] 
Implications for Software Design 

As previously descnbed, the Reasoning Skills program contains several 
instructional design features that altow the student to achieve competence in a 
complex area of knowledge. Most important Is a generaHzable strategy that applied 
to all arguments except ones containing double negatives. Once we devised an 
overall strategy, the program was divkied into distinct components. Necessary skills 
for each component were pretaught. For example, students were taught how to 
evaluate evklenca statements to see If they were appropriate before they applied 
rules for using key words in determining the logical soundness of the conclusion. 

Each component was chained to the next one. As the leamer moved from one 
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component to the next, prompts were faded. That is, added instructional elements - 
ones that would trigger the execution of a compos nt - were gradually removed. 
For example, once a student learned how to evaluate the appropriateness of 
evidence statements they were prompted to look bt the first word h the conclusion. 
Eventually, this prompt was laded and instruction focused instead on the two classes 
in the conclusion that followed the key word, introduction of new components and 
prompt fading continued until all types of arguments (i.e.. alL SOIDSL Dfi) were 
gradually integrated. Students were given discrimination practice between the 
different types of arguments for the remaining lessons in the program. 

By minimizing the verbage traditionally-associated witfr the subject and ' * 
concentrating, instead, on class size, the student is able to 'reason' about 
arguments. The program demonstrates that a CAi tutorial can teach these skills 
without added teacher instmction. What is reouired Is a carefu^ preliminary analysis 
of the content by a curriculum designer. The next step, which has not been 
completed yet. i'^ to link the program to further instruction in reasoning and logic (e.g., 
analyzing longer arguments or detecting improper generalizations <n short 
paragraphs). 

Health Knowladiyar Instrufition in Probtem Soivino 
Secondary students spend a considerable anfx}unt of thair time complet'.ig 
application-oriented activities. The*.^ academic tasks often involve higher order 
cognitive skills, and students are asked to make a variety of inferences abc'it a 
subject area by prudently using facts, concepts, and content related strategies or 
problem solving skills. Some writers (Doob. 1972; Qreenblat & Duke. 1975; Budoff. 
Thormant.. & Qras. 1984) have suggested that one way to enhance the higher order 
skills of students is through educational simi^.atlons. In addition, simulations have 
been suggested as a way of increasing me participation of lower achieving and 
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inattentive students ( Farran, 1968; Boocock & Schild, 1968: Stembler, 1975 ). 

One of our interests in studying simulations was to investigate how they could 
be used to enhance ~ rather than replace - secondary level instruction, not only in 
terms of their aftect on basic fact and concept retention, but as they related to 
problem solving. We chose a health simulation because it was designed to foster 
the acquisition of particular strategies. Health Ways was preceded by a tutorial 
containing three simpler versions of the simulation profiles, each one sOghtly mora 
complex than the preceding one. This gradual progression from simple to complex 
allowed aspects of an overall monitoring strategy to be introduced and practiced. 
Health was also a good subject area because it is rich iniacts and concepts. 

Rgure 3 gives a visual representation of the strategies the students needed to 
use to succeed at Health Ways. Students monitor three separate strategies (i.e., 
prioritizing and changing bad habits, checking the stress level, and maintaining 
health changes) through a monitoring or meta-strategy. While playing a Health 
Ways, game, the student first priori^zies and changes a bad health habit, moving 
down through the tree until an appropriate action csn be taken. It there is no cunrent 
disease, he or she next looks at the hereditary diseases. If there is one, a related 
health habit (e.g., eating foods with too much sugar for a person with an hereditary 
history of diabetes) is identified and the student attempts to change the habit through 
the F8 "computer." F8, essentially, simulates fate or chance, it displays four random 
numbers, each between five and twenty-five. Number values are assodated with 
successful changes and the score on F8 determines whether or not the habit can be 
changed. 

[Insert Rgure 3 about here] 
Regardless of the success on the F8 game, the student must return to the upper 
level of the the tree and move to the right to the check-stress- level strategy. Again, 
the student descends in the tree, this time in the check-stress branch, to detemiine 
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the appropriate action. Next, the student returns to the upper level, moves to the 
maintain-health-changes strategy and, if necessary, descends in that branch. The 
process of descending anc traversing the tree (i.e., going back to the far left once the 
rig^t most branch is checked) is repeated until the student succeeds or fails at 
achieving the main goal (i.e., increasing expected age to winning age). 
Comparative Performance of the Mildly Handicanoed Studftnts 

To measure the effects of the simulation, thirty students were randomly 
assigned to either the conventional or simulation condition. Direct instruction 
techniques were used to teach a typical health curriculum to all students for 20 
minutes per day for twelve days. This was the first part of each day's lesson. 

At the end of the initial instruction, students separated into two groups - one' 
which worked on applicaticn activities {Xha conventional group) and the other with 
the computer simulation (the simulation gro.up). The conventional group worked in 
the resource room under the supervision of the resource room teacher, who 
presented these students with a variety of application or review activities. 

Simulation students, on the other hand, were taught in a computer lab, each 
student working individually at a microcomputer. The twelve day course of 
instruction for these students was broken into three phases: initial modeling (three 
days), guided practice on three simulation games (two days), and independent 
practice with individual feedback from the instructor (seven days). During the initial 
modeling phase, students were taught an explicit strategy for using the simulation. 
The effects of appropriate and inappropriate strategies were demonstrated. 
Students were first shown how to prioritize health problems by using infomiation they 
had learned in the direct instruction portion of the lesson. As the researcher 
demonstrated progressively more difficult games or profiles, students were shown 
how to monitor and change two other variables: stress level and maintenance of 
health changes. During the guided practice phase, students were then able to 
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practice different strategies with feedback from the researcher. 

Students were assessed one day. two days, and two weei<s following the 
instruction. On the first day. student's acquisition of basic facts and concepts about 
health taught in the curriculum was measured by the Nutrition and Disease Test. 
The first 20 questions of this test were solely from the written curriculum. The 
remaining 10 covered material that appeared in boih the written cum'cuium and the 
Health Ways simulation. Internal consistency reliability (coefficient aipha) of this 
measure is .84. On the second day, the students were given the Health Diagnosis 
Test, an individually administered test that measured prioritizing skills. This test was 
a set of three written profiles and measured health related problem solving skills, (i.e., 
the student's ability to detect important health problems, facing an individual,.identify 
and change related health habits, and control stress as it increased due to the health 
changes). The Health Diagnosis Test has a test - retest reliability of .81 . Two weeks 
after the instmction the students were again given the Nutrition and Disease Test. 
This served as a retention measure. 

The 30 item Nutrition and Disease test was broken into two subscales: (a) items 
reinforced by the Health Ways simulation, and (b) items taught in the curriculum and 
not reinforced by the simulation. The effect on items reinforced by Health Ways was 
significant (fi < -01 ) and nonsignificant for those items not reinforced (n < .06). This 
indicates that the simulation was an effective vehicle for reviewing material that had 
already been taught in the written cuniculum. 

1-tests performed on the Diagnosis Test demonstrate a significant difference 
between the two groups (p < .001) in problem solving skills. Simulation students 
were better able to diagnose health problems, prioritize them as to their effects on an 
individual's longevity, and prescribe appropriate remedies. 
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ComoariSOn of Ihe Mildiv Handirannflri with Nnn-Hflndicannftri gtiiriftnt<^ 

A one way analysis of variance (ANOVA) compared the test performance of the 
conventional and simulation groups with a random selection of students from regular 
health classes who did not participate in the study. Again, scores froTi each section 
of the Health Ways Nutrition and Disease Test and the Health Ways Diagnosis Test 
were analyzed. Table *i shows a significant difference between the three groups on 
the Diagnosis Test (a < .001). A Tukey post-hoc comparison indicated significant 
differences between the handicapped simulation students and those in the regular 
classroom < .01) is well as a significant difference between the regular 
classroom students and the mildly handicapped students in the conventional group- - 

(a<.oi). 

A significant difference also appeared between the groups on the reinforced 
subscale of the Nutrition and Disease Test (a < .01). Tukey post-hoc comparison 
showed a significant difference between the mildly handicapped simulation group 
and the two other groups (fl < .05), %voriig the handicapped students taught by 
Health Ways, but no difference on items not reinforced. 

We infer from the results that a combination of direct instruction in basic facts 
and concepts with a computer simulation was successful in teaching problem solving 
In a content area. Further, the superior perfomiance by those in the simulation group 
over non-handicapped students from regular health classes shows that this kind of 
problem solving is by no means an automatic by-product of regular high school 
instmction. Instead, teaching competence in health requires a careful orchestration 
and integration of facts, concepts, and strategies. 
Implications for Software r>Qsign 

The success of the Health Ways study was a direct product of a careful analysis 
of simulation interventions. As Rgure 3 indicates, a student must use many skills. In 
order to execute appropriate actions, a student must have a firm grasp of both facts 
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(e.g.. what is cholesterol? What disease Is related to cholesterol?) and strategies 
(e.g., The stress level Is going up and I haven't changed an Important bad habit yet. 
What do I do?). In such a network of Information is is easy for a student to act in 
many ways that lead to serious errors. For mildly handicapped students, the effect of 
this is usually frustration and a failure to leam anything from the simulation. 

This is why an explicit strategy is essential. As with Reasoning Skills, 
components of the strategy are progressively introduced and then chaindd together. 
Here, students first learned about prioritizing and then wer^ prompted to execute 
specific actions under certain conditions (e.g., The character's current disease is 
lung cancer. What related habit should you look at? Does alcohol have anythrng to 
do with lung cancer? Does smoking?). When the next component (i.e.. stress 
management) was introduced, prompts for students for prioritizing were gradually 
faded. The fading, which lasted through the guided practice phase, allowed students 
to maintain a high level of success while learning essential skills. 

Integrating software with ti'aditional curriculum and using an explicit strategy for 
using the simulation had a very significant effect on problem solving ability and 
hence, student competence in the content area. In health, as with many science and 
social studies areas, there are wide range of goals, many of which are discretionary. 
This study shows that both the curriculum and the software can be adapted to meet 
important instructional goals, ones that lead to increased competence in the subject 
matter. 

Finally, linking traditional practices to computer instruction allows for the optimal 
use of each medium. Group instruction is an efficient way of teaching and firming 
basic fact and concept knowledge. It is partlculariy appealing where schools only 
provide enough computers for an entire class in a computer lat). With the high 
demand placed on labs, computer time must be used judiciously. In this study. 
Health Ways was used to teach problem solving skills that could not be easily be 
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demonstrated by conventional means. Thus, computer use was restricted to an area 
where it optimized instruction. 

Conclusions 

The three studies suggest that instructipnai design principles and computer 
technology can effectively wori< together in teaching mildly handicapped students to 
think more effectively about a content area. We believe that successful programs- 
either with or without the use of technology - begin with a careful analysis of how 
best to teach the content. This requires an understanding of how different kinds of 
knowledge in the content area are related as well as how the knowledge can be 
effectively sequenced. From this content anatysiSnanHntogral part of the instructional 
design process, comes the use of empirically based principles for sequencing and 
presenting the material. Finally, we consider whether or not technology is the most 
efficient or optimal means of instruction. 

The three studies reviewed above bear out this curriculum process. The ability 
to define words is fact level knowledge that requires considerable practice. An 
above average amount of practice is required for mikily handicapped students. To 
increase the efficiency of this practice, an optimal example set size and cycles of 
review were employed. Rnally, we used a CAI program incorporating these design 
principles in order to relieve teachers of this time consuming and relatively low level 
task. 

In analyzing elementary reasoning skills (our example of concept teaching), we 
noted that traditional instruction often does not provide explicit, step^}y-step 
strategies and tends to be laden with too many terms and definitions. An elaborated 
con'ection was used because the content was rule based and thus, when students 
erred they were reminded of the procedure for deriving the answer. 

We used a CAI program to test whether or not such knowledge couki be 
adequately taught as a tutorial. Typically, CAI programs merely provide drill and 
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practice exercises to supplement teacher instruction. Here the program was a true 
tutorial- it did all the instructing. 

Finally, our simulation instruction evolved out of an examination of problem 
solving instmction. As with social studies, health is a discretionary content area, 
allowing for various instructional goals. Typical health instruction often treats many 
diseases and bad health habits in an indepth, but undifferentiated fashion. That is, 
students are rarely given the opportunity to comparatively examine and prioritize the 
relative impact of different habits on a particular individual's heredity and lifestyle. 
Nor do they integrate the implications growing out of this prioritization with stress 
management and maintenance of habit changes. . 

By combining direct instruction in bs^ic facts and concepts with an explicit 
strategy for using a simulation, we were able to teach more advanced forms of 
knowledge in health. The explicit strategy enabled the mildly handicapped students 
to focus on the essential features of the simulation and not be misled by distracting 
information. We chose the computer simulation because it allowed us to dynamically 
display chatiges in the three main areas of each profile (i.e., prioritizing, stress 
management, and maintenance). Each change had repercussions on all other 
system variables, thus forcing the student to manipulate several factors at once. 
Although it might have been possible to demonstrate these kinds of interactions 
through other means (e.g., role playing, board games), we are convinced that a 
computer is the optimal medium for demonstrating dynamic change. 

It is tempting to try to infer too much from the results of the quasi-experimental 
component of our studies. A truer reflection of the combined effects of instructional 
design principles and technology would come from a series of aptitude treatment 
interaction studies. It is worth noting, however, that our studies do give us some 
Indication of the effect of these two variabfes on mastery of material. Results of the 
vocabulary study, for example, show that handicapped students achieved 
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performance levels comparable to non>handlcapped peers. This was a function, no 
doubt, of increased and efficient practice as well as direct instruction in the material. 
Furthermore, it is unlikely that the lack of difference between the two populations was 
due to some change in general intelligence or distractabllity. 

Data from the logic study are even more compelling. Significant differences on 
Total Test show a superior knowledge of \oq\c by the handicapped secondary 
students over college level education students. One might attribute this difference to 
the general difficulty of the skill. But it is the nonsignificsint differences between the 
college level logic students and the handicapped students that Is the most important 
comparison. . These data indk:ate that mildly handicapped-students can be taught 
complex material to a level that is comparable to older, non-handicapped students 
who have received different instruction over the same content. 

The mildly handicapped students who received the health simulation scored 
higher than both the handicapped control group and the non-hanc!capped peere 
from regular health classes on the problem solving measure. Admittedly, much of 
this difference can be attributed to specific instruction in this area; these skills are not 
a ready by-product of typical health instruction. However, these are desirable 
problem solving skills, as noted by three secondary health teachere who reviewed 
the meac ure. Furthermore, these skills reflect an above average level of 
competence. 

We base this comment on the performance of two non-handicapped students 
who were given the measure. Both students, rated by the teacher in a later 
discussion as being two of the best students in the class, had the highest scores on 
the Diagnosis Test. When asked by the researcher why they had completed the 
exercises as they did, each student articulated a set of strategies that were fully 
consistent with those contained in the explicit strategy. Thus, the handicapped 
students in the simulation group were taught to use strategies highly comparable to 
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ones used by the two non-handicapped students. Performance by these two 
fitudents reveals an integration of health knowledge that is at a higher level than 
most of their peers yet comparable to many of the handicapped students In the 
simulation group. 

Our results do not imply that a concentrated effort In instructional design and 
technology will erase all differences between mildly handicapped and non- 
handicapped students. There are simply too many other variables that account for 
the difference between the two groups. Instead, by pushing these two factors to the 
forefront, we are better able to explore the limits of education for the mildly 
handicapped in a far more precise manner. In doing this,.we come closer to<-. - 
enabling secondary mildly hanjicappsd students to meeting the basic requirements 
expected of their non-handicapped peers. 

Technology enables us to present certain kinds of instruction (e.g., the dynamic 
change in a health profile) in ways that we were incapable of doing in the past; the 
consequence being an inte^)ratlon of traditional and technology based curriculum. 
This point is critical. Software designers in the past have too often looked at 
technology based programs as stand alone products. Many times this has led to 
peculiar developments ~ to name formats that will hoki the students attention or to 
programs that are so broad (e.g.. LOGO, Rock/s Boots) that the Instructional goals 
are neither clear nor easily accomplished. A better understanding of the application 
of technology In special education is gradual^ emerging as it is linked to better 
instructiona! design principles. 
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Table! 



Comparison of Mildly Handicapped with Nonhandicapped Samples: 
Multiple Choice Vocabulaiy Test 



Mean 

Percent 

QiSm IfiSt D M SD Correct 

Small Teaching Set posttest 12 42.0 4.0 84.0 

Large Teaching Set posttest 12 43.7 7.7 37.4 
Nonhandicapped 

Comparison (10th grade) - 26 40.3 4.9 80.6 



Table 2 



PartloftheFonnalUgicT.* 

SfiMICa _ 80.07 

- 258 Jil 5 54 

Between Groups ^ mM 

Within Groups 1050.65 



146 

Total 
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Tables 

Part II of the Formal Logic Test 

SSMSS. D.F. Sum of Squares Mean Squares F Ratio p 

Between Groups 3 112.2 37.41 4.49 .005 

Within Groups 143 1190.18 8.32 

Total 146 1302.42 
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Tabled 



Means (M) and Swndard Dc/iauons (SD) forThrss Grouos 

U M SD 

Nimiiionand Disease Tesc 

Mildly Handcapped Studeac: 

Simuladon * * 15 22.00 3 ?"> 

Convendonal 15 TT.93 2S6 

Non-B!andcapped Students: 15 19!47 4^94 

Nutrition and Disease Tesc 
Items Reinforced bv 
Health Wav?; 

Mildly Handicapped StudeaK: 

Simuladon * 15 7,33 j 35 

Convendonal 15 5.60 '^'^Q 

Non-Kandicapped Suudenrs: 15 5o3 1*46 

Health Ways Diasnosis Tesc 
Total Score 

Mildly Handicapped Students: 

Simuladon 15 27.73 5 89 

Convendonal 15 12.47 4*88 

Non-Handicapped Students: 15 18^07 6 03 



FIGURE 1 



Test Words: Student 
Misses ^Abundant* 



Task: What does Abundar<t Mean? 

a) tied up 

b) plentiful 

c) to roam around 

d) scarce 



If 
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THE PRACTICE SET 



Store with other 
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Increased Delay 



Program Teacnes 
Abundant and 
Presents Task: 
Define Abundant 



no 



^ Define Define 
^ Abund ant J ^ Other Words 
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Define 
Abundant 



Increased Review 
on Abundant 



Figure 2a 



Problem: All incisors are teeth 
No teeth are muscles 



Tasic 1: What will be the first word in the conclusion? (all, some, no) 
Task 2: Write the conclusion on the line below. 
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Read Evidence Statements 



some 




all 



Rule : If an evidence statement begins with 
no, the conclusion begins with no. 



Write 'no' for tasIc 1 
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Rule : Conclusions beginning with no must be 
followed by the smallest class. 


\ 


f 


Rule : Complete the conclusion with the other 
class named once in the evidence statements. 


\ 


1 



Write 'No Incisors are Muscles' for Task 2 



Figure 2b 



Problem: 

No metals are plants 

All plants are living things 

No metals are living things 



Write the number below that best tells about this argument 
1. The argument is sound 

Z The conclusion does not name the smallest class 
3* Hie omclusion does not name the largest class 
4. The conclusion does not begin with the right word 



Read the evidence statements 




some 



Rule: If an evidence statement begins wiA NO, the conclusion 
begins widi NO 






Rule: Conclusions Beginning with NO must be followed by the 
smallest class 
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Goal: Increase the B^ected Age 
tttheOpdmalAge 
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